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SIGNIFICANCE  ftfjp  EXPLANATION 


In  this  survey  paper  we  summarize  the  recent  progress  on  the  problem  of 
analytic  similarity  of  matrices.  That  is,  given  n  x  n  complex  values 

matrices  A(x)  =  (aj.j(x)),  B(x)  =  (hij(x)l,  whose  entries  aij(xl,  bj^(x), 
i,j  =  1,...,n  are  analytic  functions  in  some  domain  H,  when 
B(x)  =  X(x)A(x)X-1 (x) ,  where  X(x)  and  X“1(x)  are  analytic  in  x?  what  is 
the  canonical  form  of  A  under  the  analytic  similarity?  These  problems  are 
related  closely  to  the  study  of  systems  of  ordinary  differential  equations 
having  singularities  either  in  time  or  a  parameter.  To  maVe  this  survey  paper 
to  be  self  contained  we  had  to  recall  some  basic  facts  in  theory  of  rinos, 
functions  of  one  and  several  compplex  variables.  Also  we  did  repeat  and 
extend  some  basic  facts  in  theory  of  matrices  in  order  to  aoolv  them  for  the 
analytic  similarity  problem  and  the  related  Questions. 
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Introduction 
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Matrix  theory  is  constantly  gaining  popularity  in  pure  and  applied  mathe¬ 
matics  and  as  well  as  in  other  branches  of  sciences.  Perhaps  the  best  book  on 
the  subject  is  the  well  known  book  of  Gantmacher  -  "The  Theory  of  Matrices"  which 
was  printed  in  the  beginning  of  the  fifties  in  U.S.S.R.  .  Since  then  the  litera¬ 
ture  on  the  subject  expanded  enormously.  There  are  recent  books  on  the  subject 
e.g.,  Berman-Plemmons  [1979]  which  usually  treat  special  topics  in  theory  of 
matrices,  it  is  my  personal  belief  that  the  time  is  ripe  for  writing  a  compre¬ 
hensive  treatise  on  the  main  developments  in  theory  of  matrices. 

I  decided  to  write  a  series  of  c.urvey  reports  on  the  most  attractive  and 
important  subjects  in  theory  of  »a.  as  -  Spectral  Theory.  This  paper  is  the 
first  one  in  the  series.  It  deals  with  general  types  of  matrices.  The  climax 
of  this  paper  are  Section  1.29  -  1.36  which  are  dealing  with  the  concept  of 
analytic  similarity  of  matrices.  This  subject  arises  naturally  in  the  study  of 
ordinary  differential  equations  having  singularities  either  in  time  or  a  parameter. 
See  for  example  Wasow  [1963] ,  11977)  ,  [1978]  and  the  references  therein.  As  the 
reader  can  see,  the  subject  of  analytic  similarities  of  matrices  is  far  from  being 
completed.  The  main  reason  for  the  difficulties  in  this  problem  is  the  non¬ 
existence  of  a  simple  canonical  form.  Clearly  the  topic  of  the  analytic  similarity 
of  matrices  is  a  part  of  a  more  general  algebraic  problem  of  similarity  of  matrices 
over  the  integral  domains.  That  is  the  reason  I  started  the  book  with  several 
sections  on  rings,  domains  and  fields  and  their  properties. 

I  tried  to  make  this  paper  (and  the  following  ones)  to  be  self  contained  as 
much  as  possible  from  the  matrix  point  of  view.  However,  in  dealing  with  some 
problems  in  matrix  theory  one  needs  to  use  various  kinds  of  techniques  -  theory 
of  functions  of  one  and  several  complex  variables,  methods  of  algebraic  geometry 
and  non-linear  analysis.  Whenever  these  tools  are  used  the  reader  is  referred 
to  appropriate  references.  The  basic  knowledge  for  this  paper  are  basic  results 


in  matrix  theory  (e.g.,  a  few  first  chapters  in  Gantmacher  [19591)  and  basic 
knowledge  in  function  of  one  complex  variable  (e.g.,  Rudin  [1974]).  Since  I 
tried  to  make  these  papers  self  contained  from  the  matrix  point  of  view,  I  did 
repeat  some  standard  facts  in  theory  of  matrices  as  the  Jordan  canonical  form. 
In  that  case  I  tried  to  make  the  exposition  short  and  concise.  Note  that  the 
problems  appearing  in  the  end  of  each  section  are  an  integral  part  of  the  paper 
and  sometime  they  are  used  in  the  main  text.  Finally  let  me  apologize  to  those 
authors  whose  results  were  not  mentioned  or  improperly  cited. 

Hie  four  other  papers  in  this  survey  series  are  planned  to  be  as  follows: 

2.  Cones,  convex  sets  and  norms, 

3.  Nonnegative  matrices, 

4.  Symmetric  and  hermitian  matrices, 

5.  Inverse  eigenvalue  problems. 

Also  these  reports  will  be  eventually  collected  to  a  book. 


D  -  an  integral  domain. 

F  -  a  field,  sometimes  the  division  field  of  D. 

C  -  complex  numbers. 

R  -  real  numbers. 

Z  -  integers. 

Z+  -  non-negative  integers. 

BD  -  Bezout  domain. 

GCCD  -  greatest  common  divisor  domain. 

FDD  -  elementary  divisor  domain. 

OFD  -  unique  factorization  domain. 

FID  -  principal  ideal  domain. 

I  -  ideal  in  D. 

d"  -  a  set  of  column  vectors  with  n  coordinates  in  D. 

S2  —  a  set  of  points  in  c”. 

H{£2)  -  the  class  of  all  analytic  functions  in  ft. 

Hi  -  HfS3)  for  SI  “  {?}  c  c" . 

M(fi)  -  the  quotient  field  of  H(S2),  for  connected  sets  H. 

D[x1,...,xnl  -  the  ring  of  all  polynomials  in  n  variables  with  the  coeffients  in  D 
a|b  -  a  divides  b. 

(a^>...,a]c)  -  the  greatest  common  divisor  of  a1,...,ak. 

II  -  a  D-module. 

t  k  Ik 

tx  ,...,x  ]  -  a  D-module  generated  by  the  elements  x  , ...,x  CM. 

dim  M  -  the  dimension  of  a  D-module. 

Hom(M,H)  -  the  set  of  homomorphismes  T  :  M  ♦  H. 

V  -  a  vector  space  over  F. 

L (V)  -  the  set  of  linear  operators  T  :  V  +  V. 

M  (D)  -  m  x  n  matrices  with  entries  in  D. 
mn 

VD)  -  Mnn<D>’ 

rtA)  -  the  rank  of  ACM  (D). 

mn 

I  a!  -  the  determinant  of  ACM  (D). 

n 

tr(A)  -  the  trace  of  A  c  M  (D). 

n 

DM  (D)  -  the  set  of  all  invertible  matrices  in  M_(D). 
n  n 

A  ~  B  -  A  and  B  are  left  equivalent. 

A  ~  B  -  A  and  B  are  right  equivalent, 
r 

A  ~  B  -  A  and  B  are  equivalent. 

-iii- 
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diagi  A, , . . . ,  A.  ) ,  l  •  A  -  the  direct  sum  of  A1 , 
i-1  1 


A  a  B  -  the  tensor  (Kronecker  product) 
t 


A  -  the  transpose  of  A. 

n  -  totality  of  strictly  increasing  sequences  of  k  integers  chosen  from 

A[a|B)  -  submatrix  of  A  using  row  numbered  a  and  columns  numbered  B. 

I,I(n),  In  -  identity  matrix  of  order  n. 

6,  (A)  -  the  k-th  determinant  invariant  of  A. 
k 

ik ( A)  -  the  k-th  invariant  factor  of  A. 

n(A),n(A,B)  -  the  indices  of  A(x)  and  I  ■  A  -  0  I_  respectively,  ACM  (H„), 

n  m  m  0 

B  e  M  (H  )  • 
n  0 

<  (A),  K  A,B)  -  the  number  of  local  invariant  polynomials  of  degree  p  of  A  and 
P  P 

IR  a  A  -  B  B  ^  respectively. 


r(A,B)  -  the  rank  of  0  A  -  Bt  ft  I„,  A  c  M  (D) ,  B  c  M  (D) . 

n  m'  m  n 

v(A,B)  -  the  nullity  of  I_  ft  A  -  Bfc  0  I  . 

■*  n  m 

A  *  B  -  A  and  B  are  similar. 

A(x)  ~  B(x)  -  A(x)  and  B(x)  are  strictly  equivalent, 
s 

A( x)  ^  B(x)  -  A( x)  and  B( x)  are  strictly  similar. 

A( x)  *  B(x)  -  A( x)  and  B(x)  are  analytically  similar 

A(x)  *  B(x)  -  A(x)  and  B(x)  are  pointwise  similar. 

A( x)  *  B(xl  -  A(x)  and  B(x)  are  rationally  similar, 

deg  p  -  the  degree  of  a  polynomial  p. 

C(p)  -  the  companion  matrix  of  p. 

K(m)  -  the  matrix  < 6 ^ 1 ) j >  “  1»...#m. 

C(  A, B )  -  the  set  of  matrices  X,  AX  =*  XB. 

C(A)  -  the  set  of  matrices  commuting  with  A. 

D(A,p)  -  a  p  neighborhood  of  A. 

ZjjiA)  -  the  components  of  A. 

P( A)  -  the  spectral  radius  of  A. 

o(A),  a  (A)  -  the  spectrum  and  the  distinct  spectrum  of  A. 
a 

a  A  A),  a.  (A)  -  the  peripherial  and  the  distinct  peripheral  spectrum  of  A. 
a  dp 

index  (1)  -  the  index  of  \,  \  c  a(A). 
index  (A)  -  the  index  of  A. 

||a||  -  the  t  norm  of  A. 

R( 1 , A)  -  the  resolvent  of  A. 

F( A0, . . . , Ag-1 )  -  Toeplitz  upper  triangular  matrices. 

F(An,...,A __.)  2  F(Bn, . . . , B_  . )  -  strong  similarity  of  Toeplitz  upper  triangular  matrices 


o(1)  -  quantities  which  tend  to  zero  as  r  *  0 
0(1)  -  quantities  which  are  uniformly  bounded. 
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spectral  theory  of  matrices 

I.  GENERAL  MATRICES 
Shmuel  Friedland 

1 . 1  Rings,  Domains  and  Fields. 

Definition  1.1.1.  A  non-empty  set  s  is  said  to  be  a  ring  if  there  are  defined  two 
operations  addition  and  multiplication  such  that  for  all  a , b , c  in  S 

(1.1.2)  a  +  b  e  S; 

(1.1.3)  a  +  b  =  b  +  a  (the  commutative  law) ; 

(1.1.4)  (a+b)  +  c  =  a  +  (b+c)  (the  associative  law); 

(1.1.5)  there  exists  an  element  0  in  s  such  that  a+0=0+a=a  for  every  a  e  S; 

(1.1.6)  there  exists  an  element  -a  such  that  a  +  (-a)  =  0; 

(1.1.7)  ab  e  S; 

(1.1.8)  a(bc)  =  (able,  (the  associative  law) ; 

(1.1.9)  a(b+c)  =  ab  +  ac,  (b+c)a  =  ba  tea  (the  distributive  laws) ■ 

S  is  said  to  have  an  identity  element  1  if  al  *  la  for  all  a  e  S.  S  is  called 
commutative  if 

(1.1.10)  ab  «  ba,  for  all  a,b  £  S. 

Note  that  the  properties  ( 1 . 1 . 3) - ( 1 . 1 . 9)  imply  that  aO  =  0a  =  0.  It  may  happen  that 

(1.1.11)  ab  =  0 

without  a  or  b  equal  to  0.  In  that  case  we  say  that  a  and  b  are  zero  divisors. 
Definition  1.1.12.  I)  is  called  an  integral  domain  if  r>  is  a  commutative  ring  without  zero 
divisors  containing  identity  1 . 

The  classical  example  of  an  integral  domain  is  Z  -  set  of  integers.  In  what  follows  we 
shall  use  frequently  another  example  of  integral  domains. 

Example  1.1.13.  Let  Cl  c_  In  be  a  set  of  points.  Denote  by  H ( Q)  the  class  of  all  analytic 
functions  f(z^,...,zn>  which  are  analytic  in  the  neighborhood  of  any  point 
C  =  ( C j , . .  • ,  Cn )  •:  ...  21  is  open  then  we  assume  that  f  is  defined  only  in  JJ .  In  case 

that  ..  consists  of  one  point  C  denote  H(.'i)  by  . 


Sponsored  by  the  United  States  Army  under  Contract  Nos.  DAAG29-75-C-0024  and 
DAAG29-80-C-0041. 


The  properties  of  analytic  functions  imply  that  H(.)  is  an  integral  domain  under  ' 
addition  and  multiplication  of  functions,  provided  that  is  connected.  We  shall  alvv.v 

assume  that  ..  is  connected  except  where  otherwise  stated.  The  element  0  is  the  z.  i  - 
function  and  the  identity  is  the  function  f  =  1.  For  properties  of  analytic  function:; 
one  or  several  variables,  consult  for  cxampl.  Rudin  { 1 0"74 )  and  Ounnina-Rossi  (19CS1- 

Let  a,b  Ii  -  We  say  that  a  divides  i.(a  b)  if  b  -  abj ,  bj  .  I)  .  An  element 
called  invertible  if  ajl.  For  an  invertible  a  idiot. •  by  a  ^  the  element  such  tc.at 
(1.1.14)  ..a'1  -  j"la  =  1  . 

In  II ( ..)  the  invertible  elements  are  only  those  run  ti  n:-  wt.i.-h  do  not  vanish  at  ai.v  :t.i 


Definition  1.1.15.  A  field  F  is  an  integral  domain  I.  such  that  ar.v  r.on-Zero  element 


invertible ■ 

The  familiar  examples  of  fields  are  the  set 
bers  1R  ,  and  the  set  of  complex  numbers  C  . 
way  to  obtain  the  field  of  its  quotients.  That 
b  ^  0  such  that 


the  set  of  real 


t  il  ionai r.  I:  there  is  a 


set  of  auct louts  — 


(1.1.16) 


a  c  ad  +  be  ,a.  ac 

b  d  bd  '  b  1  j  'll'  b'“  * 


Thus  the  set  of  the  rational  numbers  Q  is  the  .:uot lent  field  of  the  integers  Z. 
Definition  1.1.17.  Let  »;•  c  £n  .  Denote  by  ’•!  (  )  the  quotient  field  of  H(.  ).  Bv  M. 
denote  the  quotient  field  of  H_.  That  is  M(  .)  is  the  set  of  meromorphic  functions  in 

S 

Definition  1.1. Id.  By  D  (x, ]  denote  the  rinq  of  all  polynomials  in  n  variable 
-  — 1  n  - * - - - * - —  - 

the  coefficients  in  D 


(1.1.19) 


p(x. , . . . ,x  )  =  )  ax 

1  n  :* 


(1.1.20) 


/  .  _  „n  >  t  r  .i  i 

a  =  ( a  , . . . ,  a  e  Z  ,  :  ^  =  )  a  .  ,  x  =  x . 

1  n  +  1  1  . i  1 

1  =  1 
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Thus  S[x]  is  the  ring  of  polynomials  in  one  variable  with  complex  coefficients.  We  say  that 
the  degree  of  p(xi#...,xn)  (deg  p)  is  m  if  there  exists  a^  ^  0  such  that  ! a  I  »  m.  A 
polynomial  p  is  called  homogeneous  if  a^  =  0  for  |a|  <  deg  p.  It  is  a  standard  fact  that 
t)  (x^,...,xn)  is  an  integral  domain  (see  Problems  1.1.22-1.1.23). 

Problems 

(1.1.21) 

(1.1.22) 

(1.1.23) 

(1.1.24) 

(1.1.25) 


(1.1.26) 

(1.1.27) 

i 

I 

I 

:  I 
{  | 


let  C[a,b]  denote  the  set  of  real  valued  continuous  functions  on  the  interval 
[a,b] .  Show  that  C[a,b]  is  a  commutative  ring  with  an  identity  and  zero  divisors. 
Prove  that  B[x]  is  an  integral  domain 

Prove  that  X)[x^>  —  .xj  is  an  integral  domain.  (Use  the  previous  problem  and  the 

identity  I|x. . .  I  »  (B[x,,...,x  ,))  [x  ] 

in  l  n-1  n 

Let  p(xl(...,x  )  €  £{x,,...,x  ].  Show  that 
In  In 

P  *  l  pa  ' 

a<deg  p 

where  either  pq  *  0  or  pa  is  a  homogeneous  polynomial  of  degree  a.  Moreover, 
if  m  =  deg  p  >  1  then  p^  *  0.  The  polynomial  Pm<x1> • • • >xn>  Is  called  the 
principal  part  of  p  and  is  denoted  by  p,,.  (In  case  that  p  is  a  constant  poly¬ 
nomial  pn  =  p.) 

Let  p,q  e  B[x.  , . . .  ,x  ]  .  Show 
l  n 


(pq)n  -  pnqn 


-3- 


1.2  Bezout  Domains 


An  element  d  >  E  is  called  the  greatest  common  divisor  (g.c.d.)  of  a  if 

dja^,  i  =  l,...,n,  and  for  any  d*  such  that  d'|a^,  i  =  l,...,n  we  have  d' |d.  We  denote 

d  =  (a^....,an)  if  g.c.d.  of  a^-.-.a^  exists.  Clearly  (a^,...,a  )  is  unique  up  to  a 

multiple  of  an  invertible  element.  The  elements  a...... a  are  called  co-prime  if 

1  n  - » - 

(a, . a  )  =  1. 

1  n 

Definition  1.2.1.  D  is  called  a  greatest  common  divisor  domain  (GCB  B)  if  any  two  ele¬ 
ments  in  B  have  g.c.d. 

A  trivial  example  of  C>  I  E  B  is  Z. 

Definition  1.2.2.  A  subset  I  £  E  is  called  ideal  if  for  any  a , b  E  1  and  p , q  e  E  the 
element  pa  +  qb  belongs  to  I. 

In  z  any  ideal  is  the  set  of  all  numbers  divisible  by  an  integer  k  /  0.  In  H(ft),  the 
set  of  functions  which  vanish  on  a  prescribed  set  U  £  ft,  i.e. 

(1.2.3)  I(U)  =  (f|f(C)  =  0,  C  E  U,  f  c  H (ft) ) 

form  an  ideal.  An  ideal  I  is  called  prime  if  the  fact  ab  e  I  implies  that  either  a  E  I 
or  b  ■  I.  In  Z  the  prime  ideals  are  the  set  of  all  integers  divided  by  a  certain  prime  p. 

An  ideal  I  is  said  to  be  maximal  if  the  only  ideals  which  contain  I  are  I  and  E  .  I  is 
called  finitely  generated  if  there  exists  k  elements  (generators)  p^,...,p  e  I  such  that 
any  1  I  is  of  the  form 

U.2.4)  1  =  4l'’l  *  a2P2 . kPk 


for  some  a,,..., a,  in  B.  For 

1  k 

such  that 

example,  in  Blx,y) 

the 

set 

of  all  polync 

xnials  p(x,y) 

(1.2.5) 

p(0,0>  *  0 

is  an  ideal  which  is  generated  by 

the  polynomials  x 

and 

y* 

An  ideal  I 

is  called  principal 

ideal  if  it  is  generated  by  one  element  p. 

IX-finition  1.2.6.  E  is  called  a 

Bezout  domain  (B  B) 

if 

any 

two  elements 

a , b  e  E  have  a 

g.c.d.  (a,b>  such  that 

"VT 
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I 

1 


! 


I 


i 


f. 

t 


2  1 


1 

I 


(1.2.7)  (a,b)  «  pa  +  qb 

for  some  p,q  £  E. 

It  is  easy  to  show  by  induction  that  for  a^,...,an  e  BID  one  has 

n 

(1-2-8)  <a! . an>  *  l  Piai  • 

i«l 

We  give  another  characterization  of  BID. 

Lemma  1.2.9.  An  integral  domain  is  a  Bezout  domain  if  and  only  if  any  finitely  generated  ideal 
is  principal. 

Proof.  Assume  that  an  ideal  I  of  BID  is  generated  by  a^,...,a  .  Then  (1.2.8)  implies  that 
(a^,...,an>  e  I.  Clearly  (a^'...,a  )  is  the  generator  of  I.  Assume  now  that  any  finitely 
generated  ideal  of  E  is  principal.  For  a  given  a,b  e  B  let  I  be  generated  by  a  and  b. 

Let  d  be  the  generator  of  I.  So 

(1.2.10)  d  =  pa  qb 

i 

Also  d|a  and  d|b  since  d  generates  I.  Obviously  if  d'|a  and  d'|b  then  (1.2.10) 
implies  that  d'|d.  Thus  d  *  (a,b)  .  So  B  is  BID. 

□ 

Consider  the  ideal  I  £  lD{x,yl  given  by  (1.2.5).  Obviously  (x,y)  •  1.  As  \  (  1,  I 
is  not  principal.  Since  x,y  generate  I  we  showed  that  B[x,y]  is  not  BE.  In  particular 
F[x1( . .  .xnl  are  not  BE  for  n  >_  2.  The  same  arguments  show  that  H(S1)  are  not  BE  in 
case  that  f;  c_  for  n  £  2.  It  is  a  standard  fact  that  F[x]  is  a  Bezout  domain  (e.g.  Lang, 

[1967])  and  we  shall  demonstrate  it  later  on.  For  .1  £  E,  h(S2)  is  BE.  This  result  is  implied 
by  the  following  interpolation  of  open  sets  in  I  (e.g.  Rudin  [1974,  Thrs  15.11,15.15]). 

Theorem  1.2.11.  Suppose  that  0  is  an  open  set  in  C,  A  e  n,  A  has  no  limit  point  in  SI, 
and  to  each  t  e  A  then  are  associated  a  non-negative  integer  m(t)  and  complex  numbers 
W  ,  0  <  n  <  m(t).  Then  there  exists  f  e  H(fl)  such  that 

n.C  -  ~  -  - 

f^n*  (5)  =  n!  w  ,  C  £  A,  0  •;  n  <  m(C) 
n,c  —  — 

Moreover  of  all  w^  _  =  0  then  it  is  possible  to  choose  f  such  that  all  zeros  of  f  are 
in  A  and  f  has  a  zero  of  order  m(C)  at  each  C  c  A. 
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For  given  functions  a,b  e  H(?J)  there  exists  a  function  p  ^  H(')  such  that 

(1.2.13)  c  =  pa  +  b  , 
and 

(1.2.14)  a  =  a^,  b  =  b1c/  a^,^  t  H(2) 

That  is  c  =  (a,b). 

Proof. 

In  case  that  a  or  b  are  zero  functions  choose  p  =  1  and  the  theorem  trivially 
holds. 

Assume  that  ab  ?  0.  Let  A  be  the  set  of  common  zeros  of  a(z)  and  b(z) .  Thus 
at  most  countable.  For  each  4  e  A  let  m(z)  be  the  common  multiplicity  of  zero  in  a 
and  b(z)  at  z  =  z.  Let  f(z)  £  H(ty  whose  only  zeros  4  are  in  A  such  that  1  ha 
multiplicity  m(z).  The  existence  of  such  a  function  implied  by  Theorem  1.2.11.  Put 

a  =  af,  b  =  bf,  a,b  e  H(fi)  . 

Thus  a  and  b  do  not  have  common  zeros.  Let  B  be  the  set  of  zeros  of  a  such  that 
has  the  multiplicity  n(4)  .  According  to  Theorem  1.2.11  there  exists  g  e  H(. )  such  tha 

(1.2.15)  ~  (egU))|  =  b(k,(C)r  k  =  0 . n(;)-l,  4  f  B  . 

dzK  ; 

Since  b(4)  f  0  for  4  e  B.  Put 

(1.2.16)  P  =  (e9-b) |a,  c  =  fe9 

and  (1.2.13)  holds.  So  c|a,  cjb  and  in  view  of  (1.2.13)  c  =  (a,b). 

Corollary  1.2.17.  Let  fl  c_  C.  Then  H(s2)  is  a  Bezout  domain. 


Proof .  Let  a,b  e  H(ft).  By  the  definition  of  H(ft)  there  exists  an  open  domain  7.  £ 

such  that  a,b  e  H<ftQ)  .  (In  case  that  ft  is  open  ftQ  =  ft.)  Consider  the  functions 

p,c  e  H(ftQ)  which  are  constructed  in  Theorem  1.2.12.  Clearly  p,c  e  H(ft)  and  (1 . 2 .13) - (1. 2 . 14) 

implies  that  H(ft)  is  BE. 

G 

Problems 

(1.2.18)  Let  a,b/C  £  BE.  Assume  that  (a,b)  =1  and  (a,c)  =  1.  Shows  that  (a, be)  *  1. 

(1.2.19)  Let  I  be  a  prime  ideal  in  E  .  Show  that  E/I  (that  is  the  set  of  all  cosets  of 
the  form  I  +  a)  is  an  integral  domain. 

(1.2.20)  Let  I  be  an  ideal  in  E.  Denote  by  I(p)  the  following  set 

I(p)  =  Is  |  »  *  bp  +  q,  b  t  E,  q  e  1} 

Show  that  I (p)  is  an  ideal.  Prove  that  I  is  a  maximal  ideal  in  E  if  and  only 
if  for  any  p  t  I,  I(p)  =  E. 

(1.2.21)  Show  that  an  ideal  I  is  maximal  if  and  only  if  E/I  is  a  field. 


1 . 3  oriD,  P  IE  and  BID  domains 


A  non-zero,  non- invertible  element  p  e  D  is  called  irreducible  (prime)  if  the  only  ele¬ 
ments  which  divide  p  are  p  itself  and  the  invertible  elements.  For  example  a  positive 
integer  p  t  Z  is  irreducible  if  and  only  if  p  is  prime.  A  linear  polynomial  is  irreducible 

in  D|x,,...,x  ).  For  c  £  it  is  possible  to  determine  all  irreducible  elements  in  H  ( £1)  . 
In  — 

Lemma  1.3.1.  Let  £  I.  Then  all  the  irreducible  elements  of  H ( £1)  are  of  the  form  z  -  C, 

*,  t  ... 

Proof.  Let  f  i  H(_.)  be  not  invertible.  Then  there  exists  C  £  fi  such  that  f(C)  =  0.  So 

z  -  C  divides  Hz).  Therefore  the  only  irreducible  elements  (up  to  multiplication  by  the 

invertible  elements)  are  z  -  i],  i  e  ,1.  Clearly  — —  is  analytic  in  £1  if  and  only  if 

z-C 

n  =  t,  (  ;  i:  ,:)  .  This  proves  the  lemma. 

□ 

In  particular  if  t  >_  I  then  has  one  prime  element  z  -  t. 

Definition  1.3.2.  D  is  called  unique  factorisation  domain  (BFD)  if  any  non-zero,  non- 
invertible  element  a  can  be  factored  as  a  product  of  irreducible  elements 

(1.3.3)  a  =  P1“‘Pr 

and  those  primes  are  uniquely  determined  up  to  invertible  factors. 

Again  the  ring  of  integers  is  obviously  D  F  ID.  Another  example  of  unique  factorization 
domain  is  Flx^...,*^  for  any  n.  (e.g.  Lang  [1967]). 

Lemma  1.3.4.  Let  .  £  C  be  an  open  set.  Then  H(h)  is  not  unique  factorization  domain. 
Proof.  Let  a(z)  £  H(.'.)  be  a  non-zero  function  which  has  an  infinite  number  of  zeros  in  '.l. 
Such  functions  exist  in  view  of  Theorem  1.2.10.  If  (1.3.3)  was  holding  then  according  to  Lemma 
1.3.1  a(z)  would  have  a  finite  number  of  zeros  which  contradicts  the  choice  of  a. 

n 

A  straightforward  consequence  of  Lemma  1.3.4  that  for  an  open  set  £  (L'n,  ll( '■)  is  not 

HI  IF  I).  See  problem  1.3.17. 

Definition  1.3.3.  I.  i?  called  a  principal  ideal  domain  (I1  lid  if  any  ideal  of  |,  ie 
principal . 


# 


% 


The  standard  examples  of  P ID  are  the  ring  of  integers  and  the  ring  of  polynomials  in 
one  variable  over  a  field.  It  is  known  that  any  PID  is  QFS  (e.g.  Lang  [1967]  or  v.d. 
Waerden  [1959]).  Thus  H(  ’)  for  an  open  set  si  is  not  PID  .  A  very  useful  and  even  more 
restrictive  class  of  PID  is  the  class  of  Euclidean  domains. 

Definition  1.3.6.  D  is  called  Euclidean  domain  (ED)  if  for  every  a  0  there  is  defined 

a  non-negative  integer  d(a)  such  that 

(1.3.7)  for  all  a,b  e  D,  ab  f  0  d(a)  <  d(ab)  ,• 

(1.3.8)  for  any  a,b  c  D,  ab  ^  0,  there  exists  t,r  e  D  such  that 

(1.3.9)  a  =  tb  +  r,  where  either  r  *  0  or  d(r)  <  d(b) 

The  ring  of  integers  is  ED  with 

(1.3.10)  d(a)  =  | a |  . 

The  ring  T[x]  is  ED  with  dtp)  -  the  degree  of  the  polynomial  p(x)  .  It  is  well 
known  that  any  ED  is  PID.  Indeed,  consider  an  ideal  I  c  ED .  Choose  a  £  I  with  the 

minimal  d(a) .  In  view  of  (1.3.9)  a|b  for  any  b  e  I.  Thus  a  generates  I.  This  argument 

show  that  F[x]  is  PID.  We  also  have 

Lemma  1.3.11.  Let  (2  be  a  compact  connected  domain  in  (t.  Then  H(S1)  ijs  ED.  Here  d(a) 
is  the  number  of  zeros  of  a ( z )  in  f!  counted  with  their  multiplicities. 

Proof.  Let  a  j  0.  Then  a(z)  must  have  a  finite  number  of  zeros  in  f!.  Otherwise  there 
will  be  a  sequence  of  zeros  of  a(z)  which  converge  to  some  point  5  e  SI.  Since  a  is 

analytic  in  the  neighborhood  of  ;  a  is  the  zero  function  in  the  neighborhood.  The  con¬ 
nectivity  of  S)  implies  a  5  0  contrary  to  our  assumptions.  Let  Pa(z)  be  a  polynomial  such 
a 

that  —  does  not  vanish  in  D.  By  the  definition  d(a)  =  d(p  ).  Let  a,b  e  H(S1)  ab  i  0. 
Pa  a 


Since  (E[z]  is  ED 


(1.3.13) 


P 

a  =  (— )p  =  (-2-  t)  +  (-S-)r,  d(^)  =  d(r)  . 
Pa  a  pa  b  pa  pa 


Let  a^#a2  e  ED.  Assume  that  d(a^)  ^dfa^).  The  Euclid  algorithm  consists  of  a  sequence- 
{a^}  which  is  defined  recursively  as  follows: 

(1.3.14)  a.  =  t.a.+1  +  a.+2>  «i+2  -  0  or  d(a.+2>  <  d(..+1)  . 

Since  d(a)  >_  0  the  Euclid  algorithm  must  terminate.  That  is 
(1-3'15)  al . ak  *  °'  ak+l  =  °  • 


It  is  a  standard  fact  that 


(1.3.16)  <ai'V  =  ak  ' 

See  Problem  1.3.18.  That  is  the  g.c.d.  of  a^  and  a2  can  be  found  explicitly  in  a  finite 
number  of  steps.  While  for  an  open  set  ft  c  C  the  construction  of  (a,b)  may  involve 
infinite  number  of  steps,  i.e.  limits,  which  appear  in  the  proof  of  Theorem  1.2.11. 

P-oblems 

(1.3.17)  Let  ft  c  cn  be  an  open  set  in  Cn.  Construct  a  function  f  depending  on  one 

variable  in  ft  which  has  an  infinite  number  of  zeros  in  S  3).  Prove  that  f  cannot  be 

decomposed  to  a  finite  product  of  irreducible  elements.  That  is  K(.D  is  not  F  FID. 

(1.3.18)  Consider  the  equality  (1.3.9)  for  r  *  9.  Show  that  *a,t'  =  (b,r).  I'sing  this 
result  prove  (1.3.16). 


« 


Let  F  be  the  field  of  quotients  of  E.  Assume  that  p(x)  t  E[x]  .  Suppose  that 


(1.4.1)  p(x)  ■  P^xIPjtx),  p^x)  e  F[x),  i  »  1,2  . 

Our  problem  is  to  find  conditions  which  yield  that  p^ (x)  e  B[x] ,  i  ■  1,2.  clearly  that  if 

(1.4.1)  holds  then  we  can  multiply  p^fx)  by  a  e  F,  a  y  0,  and  divide  p2(x)  by  a.  So 
we  must  compose  some  normalizations  on  p^  and  p2-  Clearly  for  any  q(x)  e  F[x) 

(1.4.2)  q(x)  -  p(x)/a,  p (x)  E  B[x],  a  e  E 

In  case  that  B  is  a  GCBB  the  decomposition  (1.4.2)  can  be  unique  (up  to  multiplication 
of  invertible  elements  in  E)  . 

Definition  1.4.3.  Let  p(x)  be  a  polynomial  of  degree  m  iii  B[x] 

(1.4.4)  p(x)  »  a.xm+**-+a 

u  m 

The  polynomial  p(x)  is  called  normalized  if  aQ  •  1.  If  B  is  the  greatest  common  divisor 

domain  then  let  c(p)  -  (aQ . am) .  The  polynomial  p(x)  is  said  to  be  primitive  if  c(p)  =  1 

The  following  result  is  obvious. 

Lemma  1.4.5.  Let  F  be  the  field  of  quotients  of  GCBB.  Then  for  any  q(x)  c  F[x)  we 
have  the  decomposition  (1.4.2)  whan  c (p)  and  a  are  co-prime.  The  polynomial  p(x)  is 

uniquely  defined  up  to  an  invertible  factor  in  GCBB.  Moreover  q(x)  decomposes  to 

(1.4.6)  q(x)  »  jr(«),  r(x)  e  GCEBIx)  ,  b,a  e  GCBB 

when  (b,a)  =  1  and  r(x)  is  primitive. 

The  crucial  step  in  proving  that  UFBlx]  is  DFB  is  the  Gauss  lemma. 

Lemma  1.4.7.  Let  p(x),q(x)  c  DTBlx]  be  primitive  then  p(x)q(x)  is  primitive. 

Using  this  lemma  one  easily  gets 

Lemma  1.4.6.  Let  p(x)  c  OFBlx]  be  primitive.  Assume  that  p(x)  is  irreducible  in  F[x)  , 

where  F  is  the  quotient  field  of  DFB.  Then  p(x)  is  irreducible  in  DFBIx]  .  See  Lang 


[1967]  and  Problems  1.4.17-1.4.18. 


Thus  any  p(x)  -  UFFlx)  has  the  unique  decomposition 
(1.4.9)  p(x)  =  a  qj (x) • • *q  (x) 

where  q^ (x) , . . . ,q  (xl  are  primitive  and  irreducible  in  F[x)  and  a  has  the  decomposition 
(1.3.3).  I  fact  (1.4.9)  is  the  decomposition  of  p(x)  to  irreducible  factors  in  F[x]  .  Thus 
we  proved  (e.g.  Lang  [1967]). 

Theorem  1.4.10.  OF3D[x]  is  a  unique  factorization  domain. 

Normalization  1.4.11.  Let  F  be  a  field  and  assume  that  p(x)  is  a  normalized  non-constant 
polynomial  in  F[x).  Let  (1.4.9)  be  a  decomposition  of  p(x)  to  irreducible  factors.  Then 
normalize  the  decomposition  (1.4.9)  by  the  assumption  that  q^ (x) , . . . ,q  (x)  are  normalized 
irreducible  polynomials  in  F[xl.  (This  implies  that  a  =  1.) 

It  is  not  difficult  to  show  that  Lemmas  1.4. 7-1. 4. 8  yield 
Theorem  1.4.12.  Let  p(x)  be  a  normalized  non-constant  polynomial  in  TOF]D[x] .  Let  (1.4.9) 

be  a  normalized  decomposition  of  p(x)  in  F[x],  where  F  is  the  quotient  field  OFB.  Then 
each  q.(x)  is  an  irreducible  polynomial  belonging  to  DFD(x]  . 

See  Problem  1.4.19It  turns  out  that  Theorem  1.4.12  holds  for  any  H(ft),  (2  £  ffn.' 

Theorem  1.4.13.  Let  p(x)  be  a  normalized  non-constant  polynomial  in  H(ft)[x].  Let  (1.4.9) 

be  a  normalized  decomposition  in  M(x],  where  M  is  the  field  of  meromorphic  functions  in  ft. 
Then  each  q^(x)  is  an  irreducible  polynomial  in  H(ft). 

Proof.  By  the  definition  of  H  ( £2 )  we  may  assume  that  p(x)  e  H(ftg)  for  some  open  domain  ft^ 
containing  So  q  (x)  t  M  (q  )  lx)  ,  j  *  l,...,s.  Let 

(1.4.14)  q(x,z)  =  xt  +  £  f-Z-  x*1  r,  ai(z),6i(z)  t  H(ftQ)  ,  i  =  l,...,t  . 

r=l  Bt-r(z)  J 

Thus  q(x,z)  is  analytic  on  -  T  where  T  is  given  by 

t 

(1.4.15)  r  =  (z|z  r  (1  ,  ~fT  S  (z)  «  0}  . 

r  =  l 

is  a  closed  set  of  zero  measure  in  £n.  See  for  example  Gunning-Kossi 


It  is  known  that 


1 • 5  Elementary  Divisor  Domain 

Definition  1.5.1.  D  is  called  elementary  divisor  domain  ( KDD )  if  for  any  three  elements 
a,b,c  e  D  there  exist  p,q,x,y  t  D  such  that 

(1*5.2)  (a,b,c)=(px)a+(py)b+(qy)c 

By  letting  c  =  0  we  obtain  that  (a.b)  is  a  linear  combination  of  a  and  b.  So 
elementary  divisor  domain  is  Bezout  domain. 

Theorem  1.5.3,  Let  D  be  a  principal  ideal  domain.  Then  D  is  elementary  divisor 
domain. 

Proof .  Without  the  loss  of  generality  we  may  assume  that  abc  /  0,  (a,b,c)  =  1 .  Let 
(a,c)  =  d.  Since  D  is  UPD  (e.g.  Lang  [1967])  decompose  a  *  a'a",  where  in  the  prime 
decomposition  (1.3.3)  of  a, a'  contains  all  the  irreducible  elements  of  a  which  appear 
in  the  decomposition  of  d  to  irreducible  factors.  So 

(1*5.4)  a  -  a'a”,  (a', a")  -  1,  (a',c)  -  (a,c),  (a”,c)  -  1  , 

and  if  a',f  are  not  co-prime  then  c,f  are  not  co-prime.  Thus,  there  exists  q  and 
a  such  that 

(1.5.5)  b  -  1  =  -qc  +  aa” 

Let  d'  =  ( a ,b+qc) .  The  above  equality  implies  that  (d’,a")  =  1.  Suppose  that  d’  is 
not  co-prime  with  a'.  So,  there  exists  a  non-invertible  f  such  that  f  divides  d' 
and  a'.  According  to  (1.5.4)  (f,c)  »  f'  and  f'  is  not  invertible.  Thus  f' |b  which 

implies  that  f'  divides  a,c,  and  b.  This  contradicts  our  assumption  (a,b,c)  =  1. 

So  (d ',a')  =  1  which  means  (d',a)  »  1.  Therefore  there  exists  x,y  e  D  such  that  xa  + 


y ( b+qc)  =  1 


This  establishes  (1.5.2)  with  p  “  1. 

Theorem  1.5.6.  Let  fl  £  c.  Then  H(S3)  is  elementary  divisor  domain. 

Proof.  Given  a,b,c,  e  H(fl)  we  may  assume  that  a,b,c,  e  H(nQ)  for  some  °Pen  set 
f  El.  According  to  Theorem  1.2.12 

(1.5.7)  (a,b,c)  =  ( a, (b,c) )  -  xa  +  (b,c)  =  xa  +  (b*yc) 

Problems 

(1.5.8)  D  is  called  adequate  if  for  any  a,c  E  D,  ac  ?  0,  (1.5.4)  holds.  Use  the  proof 
of  Theorem  1.5.6  to  show  that  any  adequate  BD  domain  is  KDD. 

(1.5.9)  Prove  that  H(Q),  0  £  C,  is  an  adequate  domain  (Helmer  [1943]). 


1.6  Algebraically  closed  fields 


Definition  1.6*1.  A  field  F  is  called  algebraical  1  v  closed  if  anv  polynomial 
p(x)  €  F(x]  of  the  form  (1.4.4)  splits  to  linear  factors  in  f 


m 

(1.6.2)  p(x)  =  aA  TT  (x-E.),  f.  eF,i=1,*..,m,  a  ¥  n 

0  .  „  l  i  o 

i=1 

The  classical  example  of  an  algebraically  closed  field  is  tbe  field  o*  the  complex 
numbers  C  •  Tbe  field  of  the  real  numbers  F  is  not  algebraically  closed. 

Definition  1.6.3.  Let  K  and  F  be  fields.  Assume  that  T  c  F.  Then  K  is  called  an 
extension  field  of  F.  K  is  called  a  finite  extension  of  F  if  K  is  a  finite 
dimensional  vector  space  over  F. 

Thus  C  is  a  finite  extension  of  F  of  the  dimension  2.  Tt  is  Vnown  (e.o.  T.ano 
(1967] ). 

Theorem  1.6.4,  Let  p(x)  C  F(x].  Then  there  exists  a  finite  extension  F  r>f  F  such 
that  n(x)  splits  to  linear  factors  in  F[xl. 

The  classical  Weierstrass  preparation  theorem  in  two  complex  variable  is  an  explicit 
example  of  the  above  theorem,  we  state  the  weierstrass  preparation  theorem  in  the  form 
needed  later.  (Pee  for  example  Gunnina-Fossi  (1965].) 

Theorem  1.6. I>et.  Hn  be  the  rina  of  analytic  functions  in  one  variable  analytic  in  the 
neighborhood  of  the  origin.  Let  r(X)  e  F^fXl  he  a  normalized  polynomial  of  degree  n 

n  . 

(1.6.6)  p(X,z)  -  Xn  +  I  a.(z)Xn  ,  a^(z)  C  Hft ,  i=1,...,n 

i=1 

Then  there  exists  a  posit-  ive  integer  s  <  n !  ,  such  that 


*  n  (X-> . (w) ) ,  X . (w)  c  «  ,  i*1 


,n  . 


(1.6.7) 


V< > ,w  ) 


Thus,  in  that  particular  case,  the  extension  field  of  HQ  is  the  set  of  multivalued 
functions  which  are  analytic  in  z^8  in  the  neighborhood  of  the  origin.  Algebraically 
speaking,  K  is  HQ[w)  together  with  the  identity 

(1.6.8)  w*  -  z  . 

The  vector  dimension  of  K  over  F  is  s. 


t 
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We  already  pointed  out  in  Section  1.2  that  for  n  >  2  F(xj,...,xn)  is  not  BD. 

However  F[x^,...>x  ]  has  some  nice  properties.  An  important  property  is  that  any 
F[Xj(...,xn]  is  Noetherian  ring  (e.g.  Lang  [1967)). 

Definition  1.7.1.  D  is  said  to  be  Noetherian  (ND)  if  any  ideal  of  D  is  finitely 
generated. 

In  what  follows  we  shall  assume  that  F  is  an  algebraically  closed.  Let 
Pj,....p  e  F[x^,...(x  ].  Denote  by  u(p^, . . .  tpk)  the  common  set  of  zeros  of  p^-.-.p^. 

(1.7.2)  U(p  ,...,p  )  -  {z|z  -  (x  ,...,x  ),  p  (z)  =  0,  j»1,..,,k) 

*  k  i  n  j 

U(p^,...,pk>  may  be  an  empty  set.  U(pj,...,p^)  is  called  an  algebraic  variety  in  Cn. 

It  is  known  (e.g.  Lang  [1958])  that  any  non-empty  algebraic  variety  U  in  Fn  splits 

k 

(1.7.3)  V  -  U  V  , 

1-1 

where  each  is  an  irreducible  algebraic  variety.  Over  the  complex  numbers  each 

irreducible  algebraic  variety  V  is  a  closed  connected  set  which  almost  everywhere 
(in  V)  is  an  analytic  manifold  in  c"  of  a  fixed  (complex)  dimension  d  which  is  called 
the  dimension  of  V.  If  d  -  0  then  V  consists  of  one  point.  For  any  set  U  _  Fn 
let  T(U)  be  the  ideal  of  polynomials  in  F[x^,...,xn)  vanishing  on  U 

(1.7.4)  I(U)  -  {pip  e  F[x  ,...,x  ),  p(z)  «  0,  z  e  l)} 

»  n 

Consider  an  ideal  I  c_  F[x^,.../X  ).  Suppose  that  the  ideal  generated  by  p^)...,p|(. 
Clearly  I  I  (U<  p1 , . . .  ,pk> ) . 

The  celebrated  Hilbert  Nullstellensatz  gives  the  other  relation  between  the  above  two 


ideals  (e.g.  Lang  [1967)) 
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1.8  Modules 


1*8.1.  Let  S  be  a  ring  with  identity.  An  abelian  group  M  (which  has  an 
operation  +  satisfying  the  conditions  (1.1.2)  -  (1.1.6))  is  called  a  (left)  S-module  if 
for  each  r  e  S,  u  e  M  the  product  ru  is  defined  to  be  an  element  in  M  such  that  the 
following  properties  hold. 

(1.8.2)  r(vl+v2)  -  rv.j+rVj,  (r.,+r2)v  =  ^v+rjV,  (rs)v  =  r(sv),  1v  -  v  . 

A  standard  example  of  a  S-module  is 

(1.8.3)  Sm  =  {v,  v  =  (v  ,...,v  )*,  v.  €  S,  i-1,...,m} 

1  mi 

where 


(1.8.4)  u  +  v  =  (u.+v, , . . . ,u  +v_)t  , 

mm 

(1*8.5)  ru  ■  (ru, , . . . ,ru  )t,  r  e  s  . 

i  m 


M  is  said  to  be  finitely  generated  if  there  exist  n-elements  (generators) 
v  , ...,vn  such  that  any  v  e  M  is  of  the  form 

n  i 

(1.8.6)  v  =  Z  a  v  ,  a  e  S,  i*1,...,n 

i-1  1  1 

If  each  v  can  be  expressed  uniquely  in  the  above  form  then  v1,...,vn  is  called  a 
basis  in  M  and  M  is  said  to  have  a  finite  basis.  N  is  called  a  submodule  if  R  c  m 
and  N  is  a  S-module. 


For  example,  consider  a  linear  homogeneous  system 


(1.8.7) 


n 

l 

j“1 


3iiXi 


0,  aij.xj  £  0,  i-1 


f  TO  ,  e 


Thus  the  set  of  all  x  *  (x,,...^.)6  is  a  submodule  of  Dn.  In  what  follows  we  shall 
assume  that  S  is  an  integral  domain  D.  Let  F  be  a  field.  Then  a  F-module  is  called  a 
vector  space  (V)  over  F.  It  is  a  standard  fact  in  linear  algebra  (e.g.  Gantmacher 
[1959])  that  if  V  is  finitely  generated  then  V  has  a  finite  basis.  In  that  case  V  is 
called  finite  dimensional  vector  space.  The  number  of  vectors  in  any  basis  of  V  is 
called  the  dimension  of  V  and  by  dim  V.  A  submodule  of  V  is  called  a  subspace  of 
P.  Let  H  be  a  D-module  with  a  finite  basis.  Let  F  be  the  quotient  ring  of  O.  It  is 
possible  to  imbed  M  in  a  vector  space  V  over  F  by  considering  all  vectors  v  of  the 
form  (1.8.6)  when  aA  £  F,  i«1,...,n.  Thus  dim  V  -  n.  Using  this  fact  we  get 
Lemma  1.8.8.  Any  two  finite  basis  of  a  P-module  contain  the  same  number  of  elements. 

This  number  is  called  the  dimension  of  M  -  and  denoted  by  dim  N. 

Problems 

(1.8.9)  Let  H  be  a  -  D  module  with  a  finite  basis.  Let  M  be  a  submodule  of  N. 

Prove  that  if  D  is  PID  then  H  has  a  finite  basis. 

(1.8.10)  Let  M  be  a  -  D  module  with  a  finite  basis.  Assume  that  W  is  a  finitely 
generated  submodule  of  N.  Prove  that  if  D  is  BD  then  M  has  a  finite  basis. 


1.9  Matrices  and  homomorphismes 

Notation  1.9.1.  Denote  by  Mmn( D)  the  set  of  all  m  *  n  matrices  A  «  (a^),  where 
a  E  D,  i-1,...,m,  j-1,...,n.  The  set  M^ID)  ie  denoted  by  Mn(D) .  Let  M  and  H 
be  D-modules.  Let  T  :  M  +  N.  T  is  called  a  homomorphism  if 

(1.9.2)  T(au+bv)  “  aTu  +  bTv,  u,v  e  M,  a,b  £  D  , 

for  all  u.v  and  a,b.  As  usual,  let 

(1.9.3)  range  (T)  »  {v|v  «  Tu,  u  £  M,  v  £  N},  her  (T)  »  {u|Tu  *  0,  u  £  «)  , 

be  the  range  and  the  kernel  of  T.  Denote  by  Hom(M.M)  the  set  of  all  homomorphismes  of 

M  to  N.  It  is  a  standard  fact  that  Hom(M.N)  is  a  D-module  by  letting 

(aS+bT)u  “  aSu  +  bTu 


for  any  S.T  £  Hom(M.N),  a.b  £  D,  u  £  M.  Assume  that  M  and  N  have  finite  bases.  Let 

u  ^ ,  •  •  * ,  u1*1  and  v1,...,vn  be  bases  in  M  and  N  respectively.  Then  we  can  set  a 

natural  isomorphism  between  Hom(M.N)  and  Mmn(D) .  Namely,  for  each  T  £  Hom(M.N)  let 

A  “  (a. ,)  £  M  (D)  be  defined  as  follows 
i  j  mn 

i  n 

(1.9.4)  Tu  »  Z  a  v  ,  i=1,...,m  . 

j-1 

Conversely,  for  any  A  £  M  (D)  there  exists  a  unique  T  E  Hom(M,N)  which  satisfies 

mn 

(1.9.4) .  The  matrix  A  is  called  a  representation  of  T  in  bases  u1,...,um  and 

v',...,vn  .  The  rank  of  A  -  denoted  by  r(A)  -  is  defined  as  the  size  of  the  largest 

minor  of  A(|A[a|$]|,  a  £  Q,  8  £  Q,  )  which  do  not  vanish.  Thus  if  A  is  the 


I 


representation  matrix  of  T  e  Hom(M,N)  then  r(A)  -  dim  IN  if  HI  has  a  finite  basis. 

Let  A  e  N  (D).  We  shall  view  A  as  an  element  in  Hom(Dn,t/n)  by  letting  A(x)  -  Ax 
wn 

-  n 

for  x  e  0  . 

We  now  study  the  relations  between  the  representations  of  a  fixed  homomorphism 

T  e  Hom(M,N)  with  respect  to  different  choices  of  bases  in  K  and  N. 

Definition  1.9.5.  A  matrix  U  e  H  (D)  is  called  unlmodular  if  lul 
*  -  "  ■  n  1 

( the  determinant  of  U)  is  an  invertible  element  in  D. 

The  above  definition  is  equivalent  to  the  existence  of  V  e  H  (D)  such  that 

n 

<1.9.6)  UV  -  VU  -  I  , 

when  I  is  the  identity  matrix.  Indeed  |U|  is  invertible  then  U-1  exists  in  the 

division  ring  F.  Moreover  the  standard  formula  for  U_1  in  terms  of  the  minors  of  U 

implies  that  V  -  U  1  6  M  (D).  Vice  versa  if  (1.9.6)  holds  then  |U||V|  •  1  so  |U|  is 

n 

invertible.  Also  U  is  unlmodular  if  and  only  if  the  transpose  of  U  -  ufc  is  unimodular. 
notation  1.9.7.  Denote  by  UMjjtD)  the  set  of  unlmodular  matrices  in  Mn<D). 

Clearly  UM^D)  is  a  multiplicative  group  under  the  ordinary  multiplication  of  the 
matrices.  Unimodular  matrices  appear  naturally  when  we  change  bases  in  D-module. 

Lemma  1.9.8.  Let  M  be  a  D-module  with  a  finite  basis  u1,...,um  .  Then  (u1 , . . .  .u1”} 
la  a  basis  in  M  if  and  only  if  the  matrix  fi  -  (q^)  6  M^ID)  given  by  the  equalities 

„i  m 

(1.9.9)  u  -  I  i-1....,m 

J-1  13 

is  a  unlmodular  matrix. 

Proof.  Suppose  first  that  {u*,...,um}  is  a  basis  in  M.  Then 

(1.9.10)  u 3  -  £  r  uk,  j-1 .... ,m  . 

k-1  3 
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Put  R  =  (r^).  insert  (1.9.1 0)  to  (1.9.8)  to  get  QR  «  I  -  as  {u1,...,u1*}  is  a  basis. 

This  shows  that  Q  is  unimodular.  Assume  now  that  Q  e  UM  (D) .  Let  R  =  A-1.  So 

n 

— m 

(1.9.10)  holds.  Also  u  ,...,u  linearly  independent  over  D,  i.e.  0-cannot  be  written 

~  1 

as  a  non-trivial  combination  of  u  ,...,u  ,  since  otherwise  we  deduce  the  linear 
dependence  of  u',.../un’.  But  this  is  impossible  as  u1,...,um  is  a  basis  in  M.  So 
{u\...,u"}  is  also  a  basis  in  K. 


Definition  1.9.11.  Let  A,B  e  M^tD).  We  say  that  A  is  right  equivalent  to  B(A~B)  if 


(1.9.12) 


for  some  P  c  UM  (D);  A  is  left  equivalent  to  B(A~B)  if 


(1.9.13) 


B  -  QA 


for  some  Q  e  UM  (D);  A  is  equivalent  to  B(A~B)  if 
m  — - 


(1.9.14) 


for  some  0  e  UM  (D),  P  e  UM  (D) 
~ ^ —  m  n 


B  *  QAP  , 


Obviusly,  all  the  above  relations  are  equivalence  relations. 

Theorem  1.9.15.  Let  M  and  N  tie  D-modules  with  finite  bases  having  m  and  n 
elements  respectively.  Then  A,B  t  M^  are  (i)  left  equivalent)  (ii)  right  equivalent i 
(iii)  equivalent;  if  correspondingly  there  exists  T  t  Hom(M,N)  such  that  A  and  B  are 
the  representation  matrices  of  T  in  the  following  bases 


,1  m,  ,  1  n,  .  ,~1  ~m,  ,  1 

(u  ,...,u  I,  iv  ,...,v  }  and  (u  ,...,u  },  Iv  , 


n, 

,v  }  ; 


I 


u 


(ii)’  {u1,...,um}(  {v\...,vn}  and  {v1,...,vn}  > 

(iii)'  {u1 , . . .  .u”*}  ,  {v\...,vn}  and  {u\...,um},  {v\...,vn} 

Proof.  Let  A  be  the  representation  matrix  of  T  in  the  base*  u1,...,um  and 
v1,...^"  .  Assume  that  the  relation  between  the  bases  ^.....u"1  and  {u1,...,u'B}  is 
given  by  (1.9.9).  Then  the  raresentation  matrix  B  in  bases  {u1,...,uB1}  and 

1 

(v  ,...,v  }  is  given  by  (1.9.13).  Indeed 

1,1  a  W#n  ^ 

Tu  -  I  q  TU3  -  [  •  i“1 . .  « 

j-1  3  j,k-1  3  3 

which  proves  (1.9.13).  On  the  other  hand  if  we  change  the  basis  {v\...,vB}  to 

1  ,ijh  i  n  ^ 

{v  ..../V0}  according  to  v3  «  J  p  .v  >  j“1 »...#n,  P  “  (p  p)  e  UM  then  a  similar 

4-1  3*  3*  " 

1  m  »y  ^  ^ 

computation  shows  that  T  is  represented  in  {u  ,...,u  }  and  {v  ,...,v  }  by  AP. 

Combine  the  above  two  results  to  deduce  that  the  representation  matrix  B  of  T  is  bases 
{u1,...,^}  and  (v1,...,vm)  is  given  by  (1.9.14).  The  proof  of  the  theorem  is 
concluded . 

□ 
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1 . 10  Hermlte  normal  form 


The  notions  of  equivalence  of  A,B  e  M  (D)  give  rise  to  the  following  problems. 

mn 

Problem  1.10.1.  Given  A,B  e  Mmn(D).  When  A  and  B  are  (i)  left  equivalent;  (ii)  right 
equivalent;  (iii)  equivalent. 

Problem  1.10.2.  For  a  given  A  e  M^fD)  characterize  the  equivalence  classes 
corresponding  to  A  for  (i)  left  equivalence;  (ii)  right  equivalence;  (iii)  equivalence. 

Clearly  a  satisfying  solution  of  Problem  1.10.2  would  automatically  solve  Problem 
1.10.1.  However,  if  the  solution  of  Problem  1.10.2  is  unknown  or  is  complicated  there  is  a 
point  to  solve  Problem  1.10.1  separately. 

We  first  note  that  for  GCDD  the  equivalence  relations  have  certain  obvious 
invariants. 

Leimna  1.10.3.  For  A  e  M  (GCDD)  let 
- —  —  mn  ■ 

u(a,A)  =  g.c.d.{ | A(a|0] |,  6  E  Q  },  a  E  Q  , 

k  *n  k  »ro 

v(@,A)  *  g«c«da{|A[^|$]|f  ^  e  Q.  }  #  0  e  Q.  , 

k  t  iii  k  t  n 

<1,10'4)  «.(A)  >  g.c.d.{|A[v>|e)|.  *  E  Q  ,  6  E  Q  }  . 

k  k  rm  k  ,n 


(6,  (A)  is  called  the  k-th  determinant  invariant  of  A.)  Then 
k  -  - 


M(a,A) 

=  U (a ,B) , 

Wa  E 

Qk,m  if 

a;b  , 

v(B,A) 

=*  v(B,B) , 

VB  E 

2k.n  if 

AJB  , 

VA) 

•  6  (B) 
k 

if 

ArB  , 

k~1,...,  min(m,n). 
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Proof .  Suppose  that  (1.9.12)  holds.  The  Cauchy-Binet  formula  (e.g.  Gantmacher  [1959)) 


implies  |B[a,Y)l  »  £  |A(o,8)|  |P[8,Y)|.  So  u(a,A)  divides  u(a,B).  As  A=  BP-1 
CeSk,n 

v(a,B) |u(a,A) .  Thus  y(a,A)  *  w(a,B).  (Recall  that  u(a,A)  and  u(a,B)  are  determined 
up  to  invertible  elements).  The  other  equalities  in  (1.10.5)  are  established  in  a  similar 
way. 


Note  that 


(1.10.6) 


AXB 


t  t 
A  ~B  , 


A,B  CM  (D)  . 

mn 


Thus  it  is  enough  to  consider  the  left  equivalence  relation.  In  what  follows  we 

characterize  the  equivalence  class  for  left  (right)  equivalence  relation  in  case  that  D 

is  a  Bezout  domain.  To  do  so  we  need  a  few  notations. 

Recall  that  P  e  M  (D)  is  called  a  permutation  matrix  if  P  is  a  matrix  having  at 
n 

each  row  and  column  one  non-zero  element  which  is  the  identity  element  1 .  Clearly 

P  £  OM  (D)  since  P'1  -Pfc. 
n 

Definition  1.10.7.  A  unlmodular  matrix  U  e  is  called  simple  if  there  exist 

permutation  matrices  P,(J  such  that 

(1.10.8)  V  »  P(V  •  In_2 )Q  , 

where  V  is  a  unlmodular  2*2  matrix 

a  8 

(1.10.9)  V  -  (Y  5)  £  UM2(D)  , 
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(1.10.10) 


V 


a, S  -  invertible 


U  is  said 


Definition  1.10.11.  Let  A  e  Mmn(D).  The  following  row  (column)  operations  are  called 
elementary 

( i )  interchange  any  two  rows  (columns)  of  A; 

( i i )  multiply  row  (column)  i  by  an  invertible  element  a ; 

(iii)  add  to  row  (column)  j  b  times  row  (column)  i(i^j). 

The  following  row  (column)  operation  is  called  simple. 

(iv)  replace  row  (column)  i  b^  a  times  row  (column)  i  plus  b  times  row  (column) 
j ;  and  row  (column?  j  by  c  times  row  (column)  i  plus  d  times  row  (column)  j ; 
where  ad-bc  is  an  invertible  element  in  D. 

It  is  not  difficult  to  see  that  elementary  row  (column)  operations  can  be  carried  out 
by  multiplication  of  A  by  a  suitable  elementary  matrix  U  from  left  (right),  and  the 
simple  row  (column)  operations  are  carried  out  by  multiplication  of  A  by  a  simple 
matrix  U  from  left  (right). 

Theorem  1.10.12.  Let  D  be  a  Bezout  domain.  Consider  ASM  (D).  Assume  that 
-  —  ■  mn 

r(A)  -  r.  Then  there  exists  B  =  (b.  .)  e  M  (D)  which  is  left  equivalent  to  A  and 
■  i j  mn  - 

satisfies 


(1.10.13) 


bij  «  0  for  i  >  r  ; 


if 


in. 


is  the  first  non-zero  entry  in 


jth 


row  then 


(1.10.14) 


1 


< 


<  n2  <•••< 


n  <  n 
r 
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The  numbers  n3,...,nr  are  uniquely  determined  and  the  elements  bin  ,  1-1,..,, r  are 


uniquely  determined,  apart  from  arbitrary  Invertible  factors,  by  the  condltioas 


v((n. 


(1.10.15) 


<ni ) , A)  -  b^-.b^,  i-1 . .  , 


v(a,A)  =0,  a  e  Q  ,  1-1, 

l ,  ( n  -i ) 


The  elements  b 


jn^  (j  <  i)  are  then  successively  uniquely  determined  apart  from  the 


addition  of  arbitrary  multiples  of 

V*  The  remaining  elements  b^  are  now  uniquely 
determined.  Moreover,  the  unlmodular  matrix  Q  which  satisfied  (1.9.13)  can  be  given  as  a 
product  of  finite  number  of  simple  matrices. 

Proof .  Our  proof  is  by  induction  on  n  and  m.  For  n  =  m  -  1  the  theorem  is 
obvious.  Let  n  »  1  and  assume  that  for  a  given  m  >  1  there  exists  a  matrix  Q  which 
is  a  finite  product  of  simple  matrices  such  that  the  entries  (1,1)  of  QA  are  equal  to 
zero  for  i  -  2,...,m.  Let  A,)  e  and  assume  that  A  »  (ei1),  i“1,...,m. 

Put  C,  -  C  *  I,.  Then  A2  =  and  the  (i,1)  entries  of  a2  are  zero  for 


1-2,..., m.  Interchange  the  second  and  the  last  row  of  a2  to  obtain  Aj.  Clearly, 


(3) 


A3  -  (ai1  )  «  Q2a2  where  Q2  is  an  appropriate  permutation  matrix.  Let 


A4  »  (aj^ ' .“21* )C *  since  D  was  assumed  to  be  Bezout  domain,  there  exists  a,B  e  D  such 


that 


(1.10.16) 


(3)  +  «J3>  -  ,  <3>  t3)^ 

“11  +  Sa21  -  (*11  'a21  >  =  3 


As  (a, 6)  =  1  there  exists  f ,S  c  D  such  that 


(1.10.17) 


06  -  By  =  1  . 


Let  V  be  a  2x2'  unimodular  matrix  given  by  (1.10.9).  But  A  =  VA  =  (.,)■ 
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According  to  Lemma  1.10.3  v((1),A4'  -  v( ( 1 ) , Ag ) .  As  v((1),A  )  =  d  we  must  have 

d|d’.  That  is  d'  =  pd.  Thus 

(o’  =  WV  ”  =  (-p  1*  E  UM2<D>  ’ 


Let 


23  =  '«  •  Vl»v  *  W  • 

Then  the  last  m  rows  of  A,  =  (a!^)  =  Q,A.  are  zero.  So  a,6'  =  v((1),A,)  «*  v((1),A) 

o  1 1  3  3  T  o 

and  the  theorem  is  proved  in  this  case. 

Assume  now  that  we  proved  the  theorem  for  all  A.  e  M  (D)  where  n  <  p.  Let 

1  mn 

n  =  p+1  and  asm,  ..(D).  Call  A.  «  (a,.,),  i“1,...,m,  j-1,.,.,p.  The  induction 
m(  p+ 1 )  1  x j 

assumption  implies  the  existence  of  Q1  c  UM^tD)  which  is  a  product  of  simple  matrices 


such 


that  B1  *  (b^1)  =  21*1  satisfies  the  assumptions  of  the  theorem.  Let  n^,...,n^ 


ij 


be  the  integers  defined  by  A.p  If  b 


M) 

in 


0  for  i  >  s  then  n^  «  n^,  i«1,...,s  and 
(1) 


B  =•  Q 1 A  is  in  the  right  form.  Suppose  now  that  bin  /  0  at  least  for  some  i, 

s  <  i  <  m.  Let  B  «  (b!1*),  i=s+1,...,m.  According  to  what  we  proved,  there  exists 
2  in 

53,  e  UM  (D)  such  that  53, B,  =  (c,0 , .. . ,0 )b ,  where  c  is  the  g.c.d.  of 
2  m-s  £  * 

b!**  .  b*1*.  So  Bn  “  (I_  «  Q,)B,  is  in  the  right  form,  where  s  «  r  -  1, 

( s+1 )m, . . . ,  mn  3  s  '2  1 


i=l . .  nr 


Next  we  prove  (1.10.15).  First  if  a  c  Q.  ,  then 

i , ( ni~1 ) 


any  matrix  B[B|a),  B  e  Q,  has  at  least  one  zero  row,  so  |B[Bla]|  ■  0.  Thus 

l  »nr 

v(a,B)  =  0.  Lemma  1.10.3  yields  that  v(a,A)  ■  0.  Let  a  *  (^.....n^).  Then  B(Bla) 
has  at  least  one  zero  row  unless  B  equals  to  Y  ■  (1,2,...,!).  Therefore  v(a,A)  = 


v(a,B)  “  |B[yfa]|  *  b4  •••b  /  0.  This  establishes  (1.10.15).  So  n,,...,n,  are 

in^  in.  •  r 

determined  by  (1.10.15).  It  is  obvious  that  b1n^ >  • • • >brn  are  determined  up  invertible 


elements.  Suppose  that  n  =  nr*  Then  we  can  add  to 


*th 


row  (j  <  r)  any  multiple  of 


,th 


row  without  changing  the  firBt  n  -  1  columns  of  B  while  the  last  n  -  r  rows  of 

( 
i 


B  remain  zero  rows.  This  argument  shows  that  b^n  (j  <  i)  can  be  only  determined  up  to 
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the  addition  of  multiples  of 

the  above  freedom  for  j«1,. 

determined.  Let  C  e  M  (D) 
mn 

and 


b^n^.  It  is  left  to  prove  that  once  bjn^  *re  chosen  within 
.1.  i”1,...,r,  then  all  the  remaining  elements  are  uniquely 
such  that  the  n1 , . . . ,nr  columns  of  C  and  B  are  the  same 


c^  «  0  for  n^  <  j  <  ni+1,  i  <  k,  i-0,...,r  (ng  ■  -1  if  n,  >  1,  and 


nr+^  ■  n  if  nr  <  n)  . 


Assume  that 


C  -  OB,  U  e  M  (D)  . 

in 


We  claim  that 


(1.10.18) 


“ij 


4^,  i«1,...,m, 


j»1.. 


.  ,r 


We  prove  this  result  by  induction  on  n.  For  n  »  1  it  is  obvious.  Suppose  that  the 

assertion  holds  for  n  -  p.  Let  n  -  p+1.  Put  B  •  C  ^ ) ,  C  «  (c^),  i»1,...,m, 

j"1...  p.  So  C  ■  UB.  If  nf  <  p+1  then  the  induction  hypothesis  implies  (1.10.18). 

Assume  that  n  «  nr-  Then  (1.10.18)  holds  for  j>1,...,p.  The  equality  C  “  UB 

implies  c._  =  u..b„  .  As  b__  »  c_„  M  0  we  have  u..  -  1 .  Now 

r  rnr  rr  rnf  rnr  rn^  rr 


'  in„ 


“in. 


airbrn_ 


i*1,...,r-1,  c 


in. 


uirbrn. 


i“r+1 , . . .  ,m 


By  our  assumption  cin  -  bin  for  i  <  r-1  so  the  above  equalities  yield  uir  *  0  for 
i  »  r-1.  Also  the  assumption  that  cin  -  0  for  i  >  r  implies  uir 


0.  Since  the 


last  m  -  r  rows  of  C  and  B  are  equal  to  zero  we  finally  deduce  that  C  *  B.  This 


establishes  the  uniqueness  of 
as  above. 


B  provided  that  the  elements 


b3"i' 


J-i . 1 


were  chosen 


□ 


A  matrix  B  c  M 

mn 

conditions  (1.10.13)  - 
normalizations. 
Normalization  1.10.19. 
0  for  j  <  i. 

Theorem  1.10.20.  Let 


(D)  is  said  to  be  in  the  Hermite  normal  form  if  it  satisfies  the 
(1.10.14).  In  what  follows  we  shall  always  assume  the  specified 


If  bj„  is  an  invertible  element  we  choose 
—  ini  - 


3in< 


1  and 


bin 


U  be  a  unimodular  matrix  over  a  Bezout  domain.  Then  U  is  a 


i 


finite  product  of  simple  matrices. 

Proof.  Since  |u|  is  invertible  according  to  Theorem  1.10.12  each  b^  is  an  invertible 
element.  Thus  the  Normalization  1.10.19  implies  that  Hermite  normal  form  of  U  is  the 
identity  matrix.  Thus  the  inverse  of  U  is  a  finite  product  of  simple  matrices. 

Therefore  U  itself  is  a  finite  product  of  simple  matrices. 


Normalization  1.10.21.  For  Euclidean  domains  assume 


(1.10.22) 


either  bjn 


d<bjni>  <  d(bini> 


for 


j  <  i  . 


For  F[x]  we  choose  b^n  tc  be  a  normalized  polynomial. 

Combining  Normalization  1.10.21  with  Theorem  1.10.12  we  get 

Corollary  1.10.23.  Over  the  ring  F[x]  the  Hermite  normal  form  of  ASM  (Pfxl )  is 

""  "  1  mn  ““ 

unique  provided  that  the  Normalization  1.10.21  holds. 

It  is  a  well  known  fact  that  over  Euclidean  domains  Hermite  normal  form  can  be 
achieved  by  performing  elementary  rows  operations.  This  result  follows  by  considering 
2x2  matrices. 

Theorem  1.10.24.  Let  A  c  M^IBD).  Then  A  can  be  brought  to  its  Hermite  normal  form  by  a 


Proof.  Suppose  that 


Let  us  compute  ai+2  bv  (1.3.14).  Po  At  is  eouivalent  to 


i  ■(' 

1  V 


*1+2  bl+2' 

“i+1  bi+1y 


9  *  a  ~  +  a  .  b  =  h  -  t  . 

1+2  i  1  1+1 '  1+2  1  1  1+1' 


*i+2  "  "  °r  ',,*l+21  «  "'Vi* 


Thus  is  left  eouivalent  to  Aj+i.  As  the  Fucllfl  aloorlthm  terminates  after  a  fini+e 

number  of  steps  we  obtain  that  ak+1  ”  0.  If  av  ¥  0,  »v  i®  i°  Hermite  normal 
form.  Otherwise,  a^  =  a2  >  n  anA  we  perform  the  fucli*  aloorithm  on  hi,  M  to  obtain 
the  Hermite  form. 


Corollary  1.10.25.  Let  C  e  1TM?(TO).  Then  tt  is  a  finite  product  of  elementary 
unlmodular  matrices. 

Corollary  1.10.2A.  Let  IT  c  I’M  (TO) .  fben  n  is  a  finite  nrouct  of  elementary  unimorfular 
matrices. 

Problems 

(1.10.27)  Let  T  c  Pom(M,F)  where  |M  and  |N  are  |Bh  modules.  Assume  that,  m  =  WV" . 

Let  Im(T)  he  the  imaoe  of  H  in  W.  then  the  module  Iml*")  has  a  basis  tu 1 , . . . , n'uv 
such  that 


(1.10,2°) 


i  4  1 


1-1 


Ci1V  *  Cil  *  1-1 . V  * 


where  v^,...,vn’  is  a  permutation  o*  the  stan'ler*  basis  (A  .....A  )*,  1«1,.,.,m  in 

1 1  1  w 


(1.10.29)  Let  ACM  (BD)  and  assume  the  B  is  the  Hermite  normal  form.  Let  n,  <  j  < 
inn  1 

n,...  Prove  that  for  a  =  (n.,...,n.  j),  via, A)  =  b,  ...b,,  4,  b. .  (n.  *  0). 

1  1  i  (i— ij  0 


1.11  Systems  of  linear  equations  over  Bezout  domains 

Consider  a  system  of  m  linear  equations  in  n  unknowns. 


n 

(1*11,1)  aijXj'  “ij'bi  E  D'  1-1 . .  j“1 . "  - 


In  a  matrix  notation  (1.11.1)  is  equivalent  to 

(1.11.2)  Ax  =  b,  ACM  (D),  X  C  M  ,  (D)  ,  b  £  M  (D)  . 

mn  n 1  ml 

Let 

(1.11.3)  A  =  ( A,b)  C  M  ,  (D)  . 

m ( n+ 1 J 

The  matrix  A  is  called  the  coefficient  matrix  and  the  matrix  A  is  called  the  augmented 
coefficient  matrix.  In  case  that  D  is  a  field  the  classical  Kronecker-Capelli  theorem 
states  (e.g.  Gantmacher  [1959])  that  (1.11.1)  is  solvable  if  and  only  if 

(1.11.4)  r( A)  =  r ( A)  . 

Let  f  be  the  quotient  field  of  D.  Thus  if  (1.11.1)  is  solvable  over  D  it  is  also 
solvable  over  T.  Therefore  (1.11.4)  is  a  necessary  condition  for  the  solvability  of 
(1.11.1)  over  D.  Clearly,  even  in  the  case  where  m  «  n  “  1  this  condition  is  not 


sufficient 


In  this  section  we  give  necessary  and  sufficient  condition  on  A  for  the  solvability 
of  (1.11.1)  over  D  in  case  that  0  is  a  Bezout  domain.  To  do  so  we  need  the  following 
lemma. 


Lemma  1.11.S.  Let  A  e  M  (BD),  A  f  0. 

Then 

there  exists  an  m  *  m 

permutation  matrix 

P  and  an  n  *  n  unimodular  matrix  U 

such 

that 

(1.11.6)  C  -  (ci;()  -  PAU,  -  0 

for 

)  >  i,  cii  y  0,  i-1,... 

*  ,r  , 

cij  “  0 

for 

j  >  r,  r  -  r(  A) 

Proof.  Consider  the  matrix  Afc.  By  interchanging  the  columns  of  A1",  l.e.  multiplying 
A*"  from  right  by  some  permutation  matrix  P4,  we  can  assume  that  in  the  Hermite  normal 
form  of  AtPt,  nj^  »  1,  i»1,...,r.  This  establishes  (1.11.6) 

Theorem  1.11.7.  Consider  the  system  (1.11.1).  _lf  d  Is  a  Bezout  domain.  Then  (1.11.1) 
is  solvable  if  and  only  if 

(1.11.8)  r  -  r( A)  -  r(A),  &*( A)  -  6^ (A)  . 

Proof.  Assume  first  the  existence  of  x  e  M  .(D)  which  satisfies  1.11.2.  A3  we  pointed 

out  already  this  assumption  implies  the  equality  (1.11.4).  Also  from  (1.11.2)  we  deduce 

that  b  is  a  linear  combination  of  the  columns  of  A.  Consider  any  r  x  r  minor  of  A 

which  contains  the  column  b.  Since  b  is  a  linear  combination  of  the  other  columns  of 

A  we  deduce  that  this  minor  is  a  linear  combination  of  r  x  r  minors  of  A.  So  6  (A) 

r 

divides  the  minor  in  question.  In  view  of  the  definition  (1.10.4)  of  6^(A)  clearly 
6  (A)|i  (A).  Thus  we  proved  that  the  condition  (1.11.8)  is  necessary  for  the  solution  of 
(1.11.1)  over  Bezout  domain  (in  fact  over  GCDD).  Suppose  now  that  the  condition  (1.11,8) 
holds.  By  changing  the  order  of  the  equations  in  (1.11.1)  and  considering  a  new  set  of 


variables 


(1.11.9) 


y  -  (J  x,  (1  e  um  (D)  , 

n 


we  may  assume  that  A  =  C  where  C  is  given  in  Lemma  1.11.5.  In  view  of  (1.11.6)  and  the 
condition  (1.11.4)  the  last  m  -  r  are  linear  combinations  of  the  first  r  equations 
(possibly  the  coefficients  of  the  linear  combinations  are  in  the  quotient  field  of  D). 
Therefore  it  is  enough  to  show  the  solvability  of  the  system 


(1.11.10) 


y  c  x  =  b.  ,  i=  1 , . . . ,m,  c  ”/0  I  i—  1 ..... n 
j=1  13  ]  1 


Let  m  =  1.  Clearly,  in  this  case  4^(C)  =  c^  and  4^(0  *  (c  ,b  ).  The  second 
equality  in  (1.11.8)  means  =nlb1  ao  (1.11.10)  is  solvable  over  D.  Assume  that  (1.11.8) 
holds  for  the  system  (1.11.10)  (r=m).  Consider  an  m  x  m  minor  of  C  composed  from 
the  m  rows  and  the  columns  2,...,m+1.  This  minor  is  equal  to  (-1 )m_,bic22* • •cmm‘  since 

this  minor  is  divided  by  6^(0  =  we  have  that  c,  divides  b1 .  So 

x.j  »  b^/c^  e  D.  Thus  it  is  left  to  show  that  the  system 


(1.11.11) 


cijxj  *  bi  '  ciibi/cn'  1-2 . ™ 


is  solvable  over  D.  Put  C  »  (c„),  j=2,...,m,  b  •  (b^  -  cilb11/c11), 

i*2,...,m.  The  induction  hypothesis  would  imply  the  solvability  of  (1.11.11)  if 

4  .(C)  =  4  .(C).  That  is,  it  is  enough  to  show  that  c„„...c  divides  any 

m-1  m-1  22  mm  ’ 

(m-1)  »  m-1)  minor  |J|  of  C  which  is  composed  of  2,...,m  rows  and  2,...,i-1, 
i+1,...,m+1  columns  of  C.  Consider  the  minor  |J|  of  C  which  is  composed  of  the 
rows  1,...,m  and  the  columns  1,2,..,  ,i-1,  i+1,...,m+1.  Substract  from  the  last  column 


,^3L _ 


the  first  column  times  bl/ci1.  $0  |j|  =  c^lJl.  pjnc*  6JC)  -  6^(C)  w  h«ve  that 

c11***c|nlc11l^l  .  Therefore  5^^  (F)  j  I2TI  which  finally  implies  that  (1.11.11)  is  solvahl# 
over  D.  This  completes  the  proof  of  the  theorem. 


Theorem  1.11.12.  Let  A  e  M  (BD).  Then  ranoe  (A)  and  Ver  (A)  are  modules  in  fiFi 
~  —  mn  — — ~  -  ■  ■  ■ 

and  Dn  having  finite  bases  with  r(A)  and  n-r(A)  elements  respectively.  **oreover(  the 
base  of  ker  ( A )  can  he  completed  to  the  base  In  Dn . 

Proof .  As  in  the  proof  of  Theorem  1.11.7  we  mav  assume  that  A  =  f  where  c  is  given  hv 

(1.11.6).  Let  e*  =  (6  ,...,6  1^,  clearlv  Ae\.,.,Aer  is  a  basis  in  ranoe  (A)  and 

i  1  in 

er+1/...,en  is  a  basis  for  ker  (A). 


Let  a  e  f*mn(GCOD).  lr  we  expand  anv  o  x  o  minor  of  A  hv  anv  o  -  o 
1  <  p  <  q,  we  immediately  deduce  that 


(1.11.13) 


6  ( A )  1 6  (A)  for  1  4  o  <  ci  4  mintm,n)  . 
P  cr 


Definition  1.11.14.  Por  Ae  (GCDD)  denote  i i ( A) , . . . , ir( A)  (r=r(A))  the  invariant 
factors  of  A. 


(1.11.15) 


i^(A)  =  «;.(A)/5-i_1(A),  1=1 , . . . ,  r(  A) ,  (5n  (A)  =  1)  . 


Then  invariant  factor  1^(A)  is  called  trivial  if  i^(A)  is  invertible. 
From  (1.11.14)  and  (1.11.17)  we  deduce 


(1.11.16) 


i-i_l(A)li.,(A)r  1=7 , . . . ,r( A)  . 


Suppose  that  (1.11.1)  is  solvable  over  D.  T’sino  the  sect  that  h  is  a  linear  combination 
of  the  columns  of  A  and  Theorem  1.11.7  we  get  a  weaker  version  of  Theorem  1.11.7. 
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Corollary  1.11.17.  Let  A  e  Mnn( BD) •  Than  the  aygtaw  (1.11.1)  la  solvable  over  BD  if_ 
and  only  If 


(1.11.18) 


r(A)  =  r(  A) ,  lk(A)  =  ik(A),  k-1,...,r 


Problems 


(1.11.19)  Let  A  e  Mn(GCDD).  Prove  that 

(1.11.20)  I A|  -  i1(A)...in(A). 
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1.12  Smith  normal  form 

According  to  Lemma  1.10.3  and  Definition  1.11.14  for  A  c  M^tGCDD)  the  rank  of 
A  -  r  and  the  Invariant  factors  ij  ( A) , . . . , ir( A)  are  the  invariants  with  respect  to  the 
equivalence  relation.  It  turns  out  the  if  D  is  elementary  divisor  domain  then  the  above 
invariants  characterize  the  equivalence  class  of  A  with  respect  to  the  relation  A  ~  B. 
Theorem  1.12.1.  Let  0  f  A  e  Mnn(D).  Assume  that  D  is  elementary  divisor  domain.  The 
A  is  equivalent  to  a  diagonal  matrix  of  the  form 


(1.12.2) 


B  -  diag(i1{A),...,ir(A),  0,...,0) 


where  r  is  the  rank  of  A. 

Proof.  Recall  that  if  D  is  EX)  then  D  is  BD.  For  n  -  1  the  Hermite  normal  form 
of  A  is  a  diagonal  matrix  where  i  (A)  *  6  (A).  So  the  theorem  is  established  in  this 
case.  Next  we  show  that  the  theorem  holds  for  m  •  n  -  2 .  We  may  assume  that  A  is  in 
the  Hermite  normal  form 


A  - 


According 
Clearly  (p,q) 


to  the  Definition  1.5.1  there  exist  p,q,x,y  c  D 
"  (x,y)  "1.  So  there  exist  p,q,  x,y  such  that 


which  satisfy 


(1.5.2). 


Let 


pp  -  qq  -  xx  -  yy  -  1. 
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Then  (1.5.2)  and  the  above  equalities  imply 


G 


VAU  * 


VA)  gi2\  # 

^  g21  g22  / 


Since  6^(0  =  S^A)  we  deduce  that  i^A)  divides  g12  and  g21.  By  applying 
appropriate  elementary  row  and  column  operations  we  deduce  that  A  is  equivalent  to  a 
diagonal  matrix 


C  =  diag( i, ( A) ,d2 ) 


Since  6^(0)  =  i^fAjd^  *  6^ (A)  we  obtain  that  C  is  in  the  form  (1.12.2).  We  now 
prove  the  case  m  >  3,  n  =  2  by  the  induction  on  m.  Let  A  «  (a^),  1=1 » • • • »m-1 * 
j=1,2.  We  can  assume  that  A  is  in  the  form  (1.12.2).  In  particular  a1f  is  the  g.c.d. 
of  all  elements  of  A.  Interchange  the  second  row  with  the  last  one  to  obtain  A^ .  Apply 
simple  row  and  column  operations  on  the  first  two  rows  and  columns  of  A1  to  obtain 
A^  “  (aj^*)»  where  aj^*  =  i^A).  Now  use  the  elementary  row  and  column  operations  to 
obtain  A3  of  the  form 


(1.12.3) 


A3  =  i,  (A)  •  A4 


Since  A4  has  one  column  we  bring  it  to  a  diagonal  form.  So  A  is  equivalent  to  a  matrix 


diag(i1(A),  i1(AJ,)f  0, 


,0) 


We  claim  that  C  is  in  the  form  (1.12.2).  Indeed 
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6,(0  =  i  ( A)  i  ( A  )  =  6, (A) 

2  1  i  4  2 


So 


6  (A)  6  (A) 

i  ( A  )  “  -  ■  =  •  ■■■ 

1  V  i 1 ( A)  6 1 ( A) 


i2<  A)  . 


For  n  >  3  we  prove  the  theorem  by  the  induction.  Thus  we  may  assume  that  A  *=  (a  ), 


1-1 . 


,m,  is  already  in  the  form  (1.12.2).  So  a,,  is  the  g.c.d..of  all  the 


elements  of  A.  Interchange  the  second  column  with  the  last  column  in  A  to  obtain 

A^  «  (ajV).  Then  A  is  equivalent  to  a  matrix  A^  *>  (a^')  such  that  the  first  two 

columns  of  Aj  form  the  canonical  form  of  A^.  This  in  particular  implies  that 
(2 ) 

a1 1  =  i ^ ( A ) .  Perform  elementary  row  and  column  operations  to  bring  A2  to  a  matrix 

Aj  of  the  form  (1.12.3).  As  i^Aj)  =  <  A)  we  obtain  that  i,(A)  divides  all  entries 

of  A4.  The  induction  hypothesis  implies  that  A  is  equivalent  to  a  diagonal  matrix 


C  —  diag( i1(A),i1(A4),...,ir_1(A4>,0,...,0)  . 

It  is  left  to  show  that  C  is  the  matrix  (1.12.2).  Indeed,  as  i, ( A) I i..( A4)  and 
ij( A4 ) I i ( A4 )  we  immediately  deduce  that 


6k(C)  -  i1(A)i1(A4).*«ik_1(A4)  . 


SO 


1,(0  =  i,  ( A) ,  ik(C)  -  ik_1(A4),  k=2,...,r 


-<J  1  - 


This  shows  that  A  Is  equivalent  to  the  matrix  B  given  by  (1.12.2).  The  proof  of  the 
theorem  is  completed. 

The  matrix  (1.12.2)  is  called  Smith  normal  form  of  A. 

Corollary  1.12.4.  Let  A,B  €  Mm[1(KDD).  Then  A  and  B  are  equivalent  if  and  only  if 
A  and  B  have  the  same  rank  and  the  same  invariant  factors. 

Over  elementary  divisor  domain  the  system  (1.11.2)  is  equivalent  to  very  simple  system 

(1.12.5)  i^lAly^  =  Cy,  k=1 , . . . ,r( A) ,  0  =  ck,  k=1,...,m  , 

(1.12.6)  y  =  P_1x,  c  =  Qb 

where  P  and  Q  are  the  unimodular  matrices  appearing  in  (1.9.14)  and  B  is  of  the  form 
(1.12.2).  For  the  system  (1.12.5)  Theorems  1.11.7  and  1.11.12  are  quite  obvious.  We  also 
have 

Theorem  1.12.7.  Let  A  e  M^fEDD).  Assume  that  all  the  invariant  factors  of  A  are 
trivial.  The  the  basis  of  range  (A)  can  be  completed  to  a  basis  of  Dn. 

In  what  follows  we  adopt 

Normalization  (1.12.8).  Let  ACM  (rlxl).  Then  the  invariant  polynomials  (the  invariant 
—  —  —  -  —  mn  “  ■ 

factors)  of  A(x)  are  assumed  to  be  normalized  polynomials* 

Problems 

(1.12.9)  Let  A  =  diag(p,q)  e  M2(BD).  Then  A  is  equivalent  to  diaq((p,q),  ) . 

(1.12.10)  Let  AeM  (D),  B  e  M  (D)  and  assume  that  D  is  GCDD.  Suppose  that 

mn  pq 

either  is<A)Ut(B)  or  it(B)|is(A)  for  s”1,...,r(A)  »  a,  t*1,...,r(B)  -  6.  Show  the 

set  of  the  invariant  factors  of  ABB  is  {i,(A),...,i  (A),  i  (B) , . . . , 1  (B) } . 

1  a  1  8 

(1.12.11)  Let  *£M  be  D  modules  with  finite  bases.  Assume  that  D  is  EDD.  Prove 

that  there  exists  a  basis  u1>....un  in  H  such  that  . . • . . irur  is  a  basis  in  N 

when  ij(«..'i  c  D  and  j=1 , . . . , r-1. 
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(1.12.12)  Let  M  be  a  D  module  and  , Nj  £  K  be  submodules.  N.,  and  Nj  are  called 

equivalent  if  there  exists  T  £  Hom(M,M) ,  T  isomorphism  (i.e.  T  1  £  Hom(M,M) )  such  that 

TN.  =  N,.  Suppose  that  M,  N.  and  N.  have  bases  (u ...... u  },  (v  ,...,v  }  and 

•  *  1  *  1  n  i  m 

(w  ,...,w  }  respectively.  Let 
1  m 


n  n 

(1.12.13)  v  »  T  a.  . u.  ,  w.  =  I  b,.u.,i=1,...,m,  A*(a..),  B=(b  ) 
i  js=1  ij  3  i  j=1  13  3  13  13 


Show  that  and  Nj  are  equivalent  if  and  only  if  A  ~  B. 

(1.12.14)  Let  H  £  M  be  D  modules  with  bases.  Assume  that  N  has  a  division 
property.  That  is  if  ax  £  N  for  O^aeD,  x  £  M  then  x  E  N.  Show  that  if  0  is 
■DD  and  N  has  a  division  property  then  any  basis  in  N  can  be  completed  to  a  basis  in 

N. 
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1.13  Applications  to  the  ring  of  local  anayltlc  functions  In  one  variable 

In  this  section  we  consider  the  applications  of  the  Smith  normal  form  to  the  system  of 

linear  equations  over  the  ring  of  local  analytic  functions  in  one  variable  in  the 

neighborhood  of  the  origin.  According  to  (1.1.13)  this  ring  is  denoted  by  Hq.  In  Section 

1.3  we  showed  that  the  only  irreducible  element  in  Hn  is  z.  Let  A  e  M  (H„).  Then 

0  mn  0 

A  «  A( z )  and  A(z)  has  the  McLaurin  expansion 


{  1.  13.  1  )  A(z)  =  l  A.  zk 

k=0  K 

which  converges  in  some  disc  |z|  <  R(A).  Here  R(A)  is  a  positive  number  which  depends 

on  A.  That  is  each  entry  a^(z)  of  A  has  convergent  McLaurin  series  at  least  for 

I Z |  <  R( A)  . 

notations  and  Definitions.  Let  A  e  M (H^).  Then  the  local  invariant  polynomials 
(the  invariant  factors)  of  A  are  normalized  to  be 

(1.13.31  ik(A)  -  z  ,  0  <  i1«...<ir,  r  *  r(A>  . 

The  number  lr  is  called  the  index  of  A  and  13  denoted  by  n  »  n(A).  For  a  non¬ 
negative  p  denote  by  »  *c^(A)  -  the  number  oflocal  invariant  polynomials  of  A  whose 

degree  equals  to  n . 

We  start  with  the  following  perturbation  result. 

Lemma  1.13.4.  Let  A , B  e  M  { H  ^  )  •  Consider  the  matrix 
-  -  mn  0  - - - —  — 

(1.13.5)  C(z)  -  A(z>  +  zk+1B(z)  , 
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where  k  is  a  non-negative  integer.  Then  A  and  B  have  the  same  local  Invariant 
polynomials  up  to  the  degree  k.  Moreover  if  k  is  equal  to  the  Index  of  A,  and  A 
and  B  have  the  same  ranks  then  A  is  equivalent  to  B. 

Proof ■  Without  restriction  in  generality  we  may  assume  that  A  is  in  the  diagonal  form 


(1.13.6) 


A(z)  =  diagfz  r,0 


,0)  . 


Let  s  =  £  k^(A).  For  t  <  s  any  t  *  t  minor  of  C(z)  =  (ci^(z))  which  does  not 


j»0 


ij' 


contain  the  first  t  diagonal  elements  c1 1  ( z) , . . .  ,ctt  (  z)  is  divisible  at  least  by 


i 1+* • i  +  k+1 


On  the  other  hand  the  minor  of  C(z)  which  is  composed  of  the 


i  +• • *i 

1  t 

first  t  rows  and  columns  of  C(z)  is  of  the  form  z  (1+z  0(z)). 


So 


(1.13.7) 


6t(C)  =  6t(A)'  t  =  1’",s  • 


This  proves 


(1.13.8) 


it(C>  =  it(A),  t  -  1,...s  . 


As 


Y  k  ,  ( A)  and  i  -  k  it  follows 
^  1  S 
j-1  3 


<  (C)  =  K  .  ( A)  ,  j  <  k,  K  (A)  <  K  (C)  . 

J  j  K  K 


Interchange  the  roles  of  A  and 


C  to  deduce 


TX'l'  " 


(1.13.9)  K^fC)  -  k^(A),  j  »  0 , . . .  ,k  . 

This  shows  that  A(z)  and  C(z)  have  the  same  local  invariant  polynomials  up  to  the 
degree  k.  Suppose  that  r(A)  =  r(C).  Then  (1.13.8)  implies  that  A(z)  and  C(z)  have 
the  same  invariant  polynomials.  That  proves  that  A  ~  C. 

D 

Consider  a  system  of  linear  equations  over  Hq. 

(1.13.10)  A(z)u  =  b(z),  A(z)  £  M  (H  ) ,  b(z)  E  M  (H„)  , 

mn  0  ml  0 

where  we  look  for  a  solution  u(z)  e  M  (H_ ) .  According  to  Theorem  1.11.7  the  system 

Ml  0 

(1.13.10)  is  solvable  if  and  only  if  r(A)  =  r(A)  and  the  g.c.d.  of  all  r  *  r  minors 

of  A  and  A  are  the  same.  In  theory  of  analytic  functions  it  is  common  to  try  to  solve 

(1.3.10)  by  the  method  of  power  series.  That  is  assume  that  A(z)  has  an  expansion 
(1.13.1)  and  b(z)  has  an  expansion 


(1.13.11) 


b(  z) 


l  b<k,zk. 
k*0 


Then  one  looks  for  a  formal  solution 


(1.13.12) 


u(z)  =  l 
k=0 


(k)  k 
u  z  , 


(k) 


M  (C)  ,  k  =■  0, 1 , ... , 
mn 


which  satisfies 
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(1.13.13) 
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In  particular  if  n  -  n(A)  then  for  any  w(0)  £  there  exists  w(x)  £  M^IHg)  such 
that 

(1.13.18)  A(x)w(x)  =  0,  w( 0 )  =  w<0)  . 

Proof .  We  first  establish  the  theorem  when  A(z)  is  in  the  Smith  normal  form  (1.13.6). 

In  that  case  the  system  (1.13.13)  reduces  to 

(VI  (v\ 

(1.13. 19 ) u  -  b  if  i  <  k,  0  -  b  if  either  i  >  k  or  s  >  r(A)  . 

s  s  s  s  s 

i- 

The  the  above  equations  are  solvable  for  k  «  0,...,q  if  and  only  if  z  divides 

bs(z)  for  all  i  <  q  and  for  is  >  q  zq+1  divides  bg(z).  If  i  <  q  then  substract 
^  ,  is  8  . 

from  the  last  column  of  A  the  s  -  the  column  times  b  (z)|z  .  So  A  is  equivalent  to 

the  matrix 


Al(z) 


diag( z 


,z  ) 


•  z*1*1  ^(z) , 


j-0 


K^A), 


6  l,(i-l)(n+1-l)<H0l 


According  to  Problem  (1.12.10)  the  local  invariant  polynomials  of  A1(z)  whose  degree  does 

A1  h 

not  exceed  q  are  z  ,...,z  •  So  A<z)  and  A^z)  have  the  same  local  invariant 
polynomials  up  to  degree  q.  Thus  we  proved  that  (1.13.19)  is  solvable  for  k  “  0,...,q 
if  and  only  if  A(z)  and  A(z)  have  the  same  local  invariant  polynomials  up  to  degree 
q.  Assume  next  that  (1.13.10)  is  solvable.  Since  A(z)  is  of  the  form  (1.13.6)  the 
general  solution  of  (1.13.10)  in  that  case  is 


i, 

Uj(z)  =  b^ ( z ) /z  j=1,..., 


r(A),  Uj(z)  -  arbitrary  for  j  »  r(A)  +  1 


n 


So 


<i,) 

Uj(0)  =  b  J  ,  . . r(A),  ( 0 )  -  arbitrary  for  j-r(A)  +  1,...,n  . 

Clearly  (1.13.19)  implies  *  a  (0)  for  k  -  i  .  The  solvability  of  (1.3.10)  implies 

8  8  S 

that  b  ( z )  -  0  for  s  >  r(A).  So  u^  is  not  determined  from  (1.13.19)  for  s  > 
s  s 

r(A).  This  proves  the  existence  of  u(z)  satisfying  (1.13.10)  such  that  u(0)  -  u*03. 

Consider  the  homogeneous  system  corresponding  to  (1.13.19)  for  k  *=  0,...,q.  So  u*0>  «  0 

for  ig  <  q  and  otherwise  u^° *  is  a  free  variable.  This  verifies  (1.13.17).  Finally  as 

the  homogeneous  system  (1.13.18)  is  solvable  then  for  q  =  n(A)  if  w*°*  e  W  ,  that  is  we 

q 

have  a  solution  of  the  homogeneous  system  corresponding  to  (1.13.19)  of  the  form 

w* , . . . ,w*^* ,  then  as  we  proved  above  (1.13.18)  follows. 

It  is  left  to  show  that  the  general  case  can  be  reduced  to  the  special  one  discussed 

above.  According  to  Theorem  1.12.1  there  exist  matrices  P  e  M  (H„),  Q  e  M  (H_ )  such 

n  0  m  0 

that 


i  i  * 

Q(z)A(z)P(z)  “B(z)  «  diagl z  ' , . . . ,z  r,  0,...,0),  r  »  r(A),  P(z)  »  £  Pvzk, 

k-0 

OB 

0(*>  -  I  QkZk.  |P0I  *  0,  |Qg|  *  0. 

k=0 


Introduce  a  new  set  of  variables 


v(z>,  v(0),  v'” .  u(z)  -  P(z)v(z),  u(k)  =  l  P  v<j),  k  -  0,1,...  . 

j-0  K"3 

) P q )  0  we  can  express  v(z)  and  v*®3,  v*33,...,  in  terms  of  u(z)  and 

u(1> .  correspondingly.  Thus  (1.13.10)  and  (1.13.13)  is  equivalent  to 


B(z)v(z)  «  c(z),  c(z)  -  8(z)b(z),  £  B  «  c^\  k  •  0,1,  ...,q. 

j-0  3 


Since 


,(0) 
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Now  the  theorem  follows  since  A  ~  B  and  A  ~  B  ■  (B,c)  as  B  «  S2A(P  ®  I1)» 

□ 

Problems 

(1.13.20)  Consider  the  system  (1.13.10).  This  system  is  said  to  be  solvable  in  the 
punctured  disc  if  the  system. 

(1.13.21)  A( 2q ) u( Zq )  -  b(zQ) 

is  solvable  for  any  point  0  <  |zQ|  <  R  (as  a  linear  system  over  C)  for  some  positive 
R.  Prove  that  (1.13.10)  is  solvable  in  the  punctured  disc  if  and  only  if 

(1.13.22)  r(  A<  z) )  «  r(A(z>)  . 

That  is  (1.13.10)  is  solvable  over  the  quotient  field  Mq. 

(1.13.23)  Consider  the  system  (1.13.10).  This  system  is  said  to  be  pointwise  solvable  if 
the  system  (1.13.21)  is  solvable  for  all  |zq|  <  R  for  some  positive  R.  Prove  that 

(1.13.10)  is  pointwise  solvable  if  and  only  if  in  addition  to  (1.13.22)  the  equality 

(1.13.24)  r(  A(0) )  -  r(  A(0) ) 

holds. 

(1.13.25)  Let  A(  z )  e  M  (H  ) .  A(z)  is  call  generic  if  whenever  the  system  (1.13.10)  is 

win  o 

pointwise  solvable  then  it  is  analytically  solvable  (i.e.  there  exists  u(z)  e  Mn1(H0) 
such  that  (1.13.10)  holds).  Prove  that  A(z)  is  generic  if  and  only  if  n(A)  <  1. 

(1.13.26)  Let  A(z!  e  M  <H(S))),  b(z)  e  M  (H(£2)),  0  c  C.  Consider  the  equation 

win  in  i 

(1.13.10) .  Show  that  (1.13.10)  has  a  solution  u(z)  e  M  ,(K(0))  if  and  only  if  for  any 

ni 

5  e  0  the  equation  (1.13.10)  is  solvable  over  H  ,  i.e.  there  exists  u(c)  e  M  (H  ) 

4  me 

which  satisfies  (1.13.10).  (Use  the  fact  that  H(ft)  is  BOO  so  as  in  Section  1.12  one 
may  assume  that  A  Is  in  Smith  normal  form.) 

(1.13.27)  Let  A(z)  and  b(z)  satisfy  the  assumptions  of  Problem  (1.13.26).  A(z)  is 
called  generic  if  whenever  the  system  (1.13.10)  is  pointwise  solvable,  i.e.  1.13.21  is 


m.'  ii  ■  .■  - j 


solvable  for  any  z.  e  f!,  then  there  exists  a  solution  u(z)  e  M  (H(S))).  Prove  that 
0  nl 

A (z)  is  generic  if  and  only  if  the  invariant  functions  (factors)  of  A(z)  have  only 

simple  zeros.  ( t  is  called  a  simple  zero  of  f  e  H(ft)  if  f(C)  ■  0  and  f'(C)  ¥  0). 

(1.13.28)  Let  ACM  (H(ft)),  ft  c  C.  Prove  that  all  the  invariant  factors  of  A  are 
mn  — 


trivial  if  and  only  if 


(1.13.29) 


r(A(OJ  -  r(  A)  for  all  ?  c  ft 


(1.13.30)  Let  ACM  (H(ft)),  ft  c  C.  Assume  that  (1.13.29)  holds.  Using  Theorem  1.12.7 
mn 

prove  the  existence  of  n  vectors  x1,...,xm  e  M  (H(ft)),  such  that 

ml 

|  x1  (5) , . . .  ,xm(C )  |  f<  0  for  all  5  c  0  and  HCp  =  [x1,...,xr],  r  »  r(A). 
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Definition  1.14.1.  A  matrix  A(  x)  €  M  (Dfx))  is  called  pencil  if 
-  -  mn  - - 


(1.14.2)  A(x)  -A  +  xA.,  A  , A  e  M  (D)  . 

0  101  mn 


A  pencil  A(x)  is  called  regular  if 


(1.14.3) 


A(  x)  e  M  (D[x)  ),  |A(x)  |  ?  0  . 

Tl 


Otherwise  the  pencil  is  called  singular.  Two  pencils  A(x),B(x)  e  ^^(Dfx)  )  are  called 
strictly  equivalent  if 


(1.14.4)  B(x)  *  QA(x)P,  P  e  UM  (D) ,  Q  E  OH  (D)  . 

n  m 

We  denote  this  relation  by  A(x)  ~  B(x). 

- ' -  8 

The  classical  works  of  Weierstrass  (1867]  and  Kronecker  [1890],  see  also  Gantmacher 
[1959] ,  classify  the  equivalence  classes  of  pencils  under  the  strict  equivalence  relation 
in  case  that  D  is  a  field  F. 

We  now  give  a  short  account  of  their  main  results.  First  we  note  that  if 
A(x)  ~  B(x)  then  A( x)  ~  B(x)  over  the  domain  D[x] .  In  fact  we  have  little  more.  Put 

(1.14.5)  B(x)  =  B0  +  XB1  . 

Then  the  condition  (1.14.4)  is  equivalent  to 

(1.14.6)  B  -  QA„P,  B,  -  QA  P ,  P  E  UM  (D),  Q  E  UM  (D)  . 

o  o  i  i  n  hi 


So  we  can  interchange  A^  with  A1  and  B0  with  without  affecting  the  strict 

equivalence  relation.  Thus  it  is  natural  to  consider  a  homogeneous  pencil 


(1.14.7)  A(  Xq  i  )  -  Xq  Af,  *■  x^A^  . 

Suppose  that  D  is  a  unique  factorization  domain.  So  (e.g.  tang  [1967])  D[Xg,x,]  is 
UFD  which  implies  that  DJxg.x^  is  GCDD.  So  we  can  define  the  invariant  determinants 
^k^X0'X1*  and  the  invariant  factors  i^jxg.x^,  k  =  1,...,r(A),  for  the  homogeneous 
pencil  A(x0,x.j). 

Lemma  1.14.8.  Let  ACxg.x^)  be  a  homogeneous  pencil  over  WFDfXQ.x^.  Then  the  Invariant 
determinants  ^*xo'xi'  and  the  invariants  factors  i^tXQ.x^,  k  =  1,...,r(A)  are 
homogeneous  polynomials.  Moreover,  if  (5^(x)  and  i^fx)  are  the  Invariant  determinants 
and  factors  of  the  pencil  A(x),  k  »  1,...,r(A),  then 

(1.14.9)  6,  (x)  -  6  ( 1 ,x)  ,  i(x)  =  1(1, x),  k  =  1 , .. . ,r( A)  . 

k  k  k  k 

Proof.  Clearly  any  k  x  k  minor  of  AtXg.x,)  is  either  zero  or  a  homogeneous  polynomial 
of  degree  k.  Thus,  in  view  of  Problem  1.14.24  we  deduce  that  the  g.c.d.  of  all  non¬ 
vanishing  k  x  k  minors  is  a  homogeneous  polynomial  6^(x(),x,)).  As  i^fxg.x^  = 

6  (x  ,x  i/6  (x  ,x.)  Problem  1.14.24  implies  that  i.  (x„,x,)  is  a  homogeneous 
k  0  1  k-1  01  x  u  i 

polynomial.  Consider  the  pencil  A(x)  which  is  given  in  terms  of  the  homogeneous  pencil 
A(x0,x,)  as 

(1.14.10)  A(x)  -  A(  1  ,x)  . 

So  6  (x)  -  the  g.c.d.  of  k  x  k  minors  of  A(x)  is  obviously  divisible  by  6  (1,x).  On 
k  k 

the  other  hand  we  have  the  following  relation  between  the  minors  of  Alx^x^)  and  A(x) 

k  X1 

(1.14.11)  A(  x  ,x  )  [a  IS]  ”  x  A(  —  )[a|B],  a, 8  €  Q 

u  i  u  xrt  K»n 
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1 

This  shows  that  Xg  d^(— )  (p^  •  deg  6  ^  ( x ) )  divides  any  k  *  k  minor  of  Afxg.x^).  So 
pk  X1  I  X° 

x  5^  (— )  |  6^  (  xQ  ,x^ ) .  This  proves  the  first  part  of  (1.14.9).  So 


(1.14.12) 


5k(X0'X1*  =  X0  (x0  Bk 


deg  5^  (  x)  ,  >  0 . 


Now  the  equality 


vw 


VVV^k-i'W 


implies 

| o  X 

<1.14. ’3)  ik(X0'X1)  “  X0,'tX0,'V7!')]  ’  °k  *  deg  VX)'  ♦k  *  °  ’ 

This  establishes  the  lemma. 

We  call  6){(Xg,x1)  and  ik(xg,x1)  the  invariant  homogeneous  determinants  and  the 
invariant  homogeneous  polynomials  ( fac  ors)  respectively. 

The  classical  result  due  to  Weierstrass  [1867]  states: 

Theorem  1.14.14.  Let  A( x)  e  M^JFIx])  be  a  regular  pencil.  Then  a  pencil  B(x)  is 
strictly  equivalent  to  A(x)  if  and  only  if  A(x)  and  B(x>  have  the  same  invariant 
homogeneous  polynomials. 

Proof.  The  necessary  part  of  the  theorem  holds  for  any  A( x) ,  B(x)  which  are  strictly 
equivalent.  Suppose  now  that  A(x)  and  B(x)  have  the  same  invariant  homogeneous 
polynomials.  According  to  (1.14.9)  the  pencils  At x)  and  B(x)  have  the  same  invariant 
polynomials.  So  A(x)  ~  B(x)  over  F(x] .  Therefore 

(1.14.15)  W(x)B(x)  =  A(x)U(x),  U(x),W(x)  e  M  (F[x)), 

n 

lu(x)|  «  Const  /  0,  |W(x) |  =  Const  f  0 
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■Wmht1£M^W«»*plirf»li»**^  SM.I. 


,V'^'iWWtf*WeL:  'A'-- 


Assume  now  that  A^  and  B1  are  non-singular.  Then  (see  Problem  1.14.25)  it  is  possible 
to  divide  W(x)  by  A(x)  from  right  and  0(x)  by  B(x)  from  left 

(1.14.16)  W(x)  =  A(  X )  W ,  (  x )  +  R,  U(x)  =  U1  (  x)  B(  x)  +  P  , 

where  P  and  R  are  constant  matrices.  So 

A(x)(W,(x)  -  U,(x))B(x)  -  A(x)P  -  RB(x)  . 

As  |  A-|  t  |B1 1  ?  0  we  must  have  W^fx)  »  U^(x)  otherwise  the  left-hand  side  of  the  above 
equality  would  be  of  degree  at  least  2  (see  Definition  1.14.19)  while  the  right-hand  side 
of  this  equality  is  at  most  of  degree  1.  So 

(1.14.17)  W1(x)U1(x),  RB(x)  ■  A(x)P  . 

It  is  left  to  show  that  P  and  R  are  non-singular.  As  W(x)  is  unimodular  there 

exists  v(x)  e  UM  (F[xM  such  that  I  -  W(x)U(x). 
n 

Let 

V(x)  -  B( x)V^ ( x)  +  S  . 

So 

X  -  (A(x)W,(x)  +  R)V(x)  -  A(  x)W^  (  x)V  ( x)  +  RV  ( x)  = 

-  A(x)W,(x)V(x)  +  RB(x)V,(x]  *  RS  » 

-  A(x)W,(x)V(x)  +  A(x)PV,(x)  +  RS  « 

-  A(x)  tW,(x)V(x)  +  PV,(x)]  +  RS  , 

where  we  used  the  second  equality  in  (1.14.17).  Since  I  A, I  *  0  the  above  equality 


implies 


W,  (  x)V ( x)  +  PV ^  (  x )  =0,  RS  =  I 


So  R  is  invertible.  The  same  arguments  show  that  P  is  invertible.  Thus  A(x)  and 
B(x)  are  strictly  equivalent  if  |A1B1I  y  0. 

Consider  now  the  general  case.  Introduce  a  new  variables  yQ,y., 

yfl  =  axfl  +  bx1  ,  y,  =  cxQ  +  dx1  ,  ad  -  cb  j  0  . 


Then 


AfyQ.y,)  =  y0A0'  +  y^,*,  B(y0,yi)  =  y^o'  +  y^,'  . 

Clearly  A(y0,y^)  and  B(y0,y.j)  have  the  same  invariant  homogeneous  polynomials.  Also 

A<y0<y.,>  ~  Bfy^y^)  if  and  only  if  A(xQ,x^)  ~  B(x(),x1).  Since  Atxp.x,)  and 

B(xq(x^)  are  regular  pencils  it  is  possible  to  choose  a,b,c,d  such  that  A1 '  and  ' 

are  non-singular.  This  shows  that  Aty^.y^  ~  Bty^y^l  according  to  the  previous  case. 

So  A(  x)  ~  B(x) . 
s 

In  fact,  we  also  proved 

Corollary  1.14.18.  Let  A(x),B(x)  E  M  (F[x]).  Assume  that  A1  and  B ^  are  non- 
singular.  Then  the  pencils  A(x)  and  B(x)  are  strictly  equivalent  if  and  only  if 
A(x)  and  B(x)  are  equivalent. 

For  singular  pencils  the  invariant  homogeneous  polynomials  alone  do  not  determine  the 

class  of  strictly  equivalent  pencils  as  in  the  case  of  regular  pencils. 

We  now  introduce  the  notion  of  column  and  row  indices  for  A(x)  e  M^tFlx)).  Consider 

the  system  (1.13.18).  The  set  of  all  solutions  w( x)  is  a  F[x]-module  N  with  a  finite 

basis  w, ( x) , . . . ,w_( x) .  (Theorem  1.11.12). 

I  s 

To  specify  a  choice  of  basis  we  need  the  following  definition. 


Definition  1.14.19.  Let  A  £  M  (Dfx . x  11.  So 

“  -  mn  1  k  — 


A(x . x  )  =  l  A  xa,  A  £  M  (D) 

1  k  i  i  ^  «  ci  a  mn 


!  at  |  <  <3 


(1.14.20) 


a, 

a  1  k 


k  r 

a  =>  (a  , . . .  ,cl  )  c  Z  ,  |a|  -  l  a  .  x~  »  x  '•••x. 

i-1  i  it 


Then  the  degree  of  A(x1#...,xk>  (deg  A)  _is  d  if  there  exists  Aq  f  0  with  |a|  -  d. 

Definition  1.14.21.  Let  A  e  M  (F[x] )  and  consider  the  module  M  c  F(xln  of  all 
"  -  mn - —  —  -■  —  — 

solutions  of  (1.13.18).  Choose  a  basis  w1 ( x) , . . .ws( x) ,  s  -  n  -  r(A)  ^n  M  such  that 
wk<x)  £  M  has  the  lowest  degree  among  w(x)  £  M  which  are  linearly  independent  (over  the 
quotient  field  of  F[x] )  of  w1 (x) , . . . .w^., ( x)  for  k  »  1,...,s.  Then  the  column  indices 
«1  <  a2  of  A(x)  are  given  as 

(1.14.22)  ak  “  de9  '  *  “  !»•••#*  • 

The  row  indices  0  <  8^<...<8  .  t  =  m-r(A),  of  A(x)  are  the  column  indices  of  the 
transposed  matrix  AC ( x ) . 

It  can  be  shown  (e.g.  Gantmacher  (1959) )  that  the  column  (row)  indices  are  independent 
of  a  particular  choice  of  a  basis  w1 ( x) , . . . ,wg( x) .  We  now  state  the  Kronecker  result 
[1890]. 

Theorem  1.14.23.  The  pencils  A(x),B(x)  £  M^tFlx]  )  are  strictly  equivalent  if  and  only 
if  they  have  the  same  Invariant  homogeneous  polynomials  and  the  same  row  and  column 
indices. 

See  for  example  Gantmacher  [1959!  for  a  proof  of  this  theorem. 

Problems 

(1.14.24)  Using  the  fact  UFDtx^ , ... ,x  ]  is  UFD  and  the  equality  (1.1.26)  show  that  if 
a  £  OFD[ x1 , . . . , xn!  Is  a  homogeneous  polynomials  then  in  the  decomposition  (1.3.3)  each 
is  a  homogeneous  polynomial. 
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(1.14.25)  Let 


(1.14.26) 


W(  x) 


k=0 


W.  x 
k 


U(x) 


U,  x 


k=0 


Assume  that  A(x)  is  a  pencil  (1.14.2)  such  that  A^  is  a  square  non-singular  matrix. 
Show  that  if  p,q  >  1  then 


W(x)  =  A(  x)  A 


tw  xq-1l 
q 


+  W( x) /  U(x)  = 


[U  xP"^ ] a”1 A( x)  +  U ( x) 
P  1 


where 


deg  W(x)  <  q,  deg  U(x)  <  p 

Prove  the  equalities  { 1.14. 16)  where  R  and  P  are  constant  matrices.  Suppose  that 
Aj  *  I.  Show  that  R  and  P  in  (1.14.16)  can  be  given  as 


(1.14.27) 


j<-VkV  p  *  jnV-\ 

k=0  k=0 


(1.14.28)  Let  A(x)  be  a  regular  pencil  such  that  |A^|  f  0.  Prove  that  in  (1.14.12)  and 

(1.14.13)  k  =  (Use  the  equality  (1.11.20)  for  A(x)  and  A(Xq,x^). 

(1.14.29)  Consider  the  following  two  pencils 


2  +  x 

1+x 

3+2x  \ 

/  2+x 

1  1 

1+x 

1+x 

3  +  x 

2+x 

5+2x 

J,  B(x)  =  j  1+x 

2+x 

1+x 

3+x 

2+x 

6  +  3x  j 

\  1+x 

1+x 

1  +  x 

over  R(x].  Show  that  A(  x)  and  B(x)  are  equivalent  but  not  strictly  equivalent. 
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(1.14.30)  Let 


A(x) 


f  vk 

k=0  ' 


e  M  (C[xl  )  . 

mn 


Put 


q 

V 

k=0 


,  q-k 
A,  x„  x 
k  0 


k 

1 


where  q  =  0  If  A(x)  =  0  and  Aa  ^  0,  Ak  =  0 ,  q  <  k  <  p  if  A(x)  ¥  0.  Let  ik(xn,x,), 
k  =  1,...,r(A)  be  the  invariant  factors  of  A(Xq,Xj).  Prove  that  i.  (Xg.Xj)  is  a 
homogeneous  polynomial,  k  =  1,...,r(A).  Show  that  ik(1,x),  k  =  1,...,r(A),  are  the 
invariant  factors  of  A(x). 

(1.14.31)  Let  A(x),B(x)  e  M  (C[xJ).  A(  x )  and  P(x)  are  called  strictlv  equivalent 

mn 

(A  ~  B)  if  B(x)  =  PA(x)Q,  P  e  M  (C)  ,  Q  e  M  (C)  .  (  1 P I  1 0 1  *  0).  Prove  that  if 
s  m  n 

A  B  then  the  A(x_,x.)  and  B(x  -x  1 
s  0  1  0  1 

have  the  same  invariant  factors. 

(1.14.32)  Prove  that  the  pencils  A(x)  and  B(x)  are  strictly  equi valent  if  and  onlv  t f 

At(x)  ~  Bt(x). 
s 
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1 . 1  5  Sim ilarj tv  of  matrices 


Definition  1.15.1.  Let  A,S  E  mhe  matrices  A  and  a  are  called  similar 

(A  n)  if 


(1.15.2) 


9  =  OAO-1  , 


for  some  0  £  DM  (D). 

-  m 

Clearlv  the  similaritv  relation  is  an  equivalence  relation.  So  Mm(n)  is  divided  to 
equivalence  classes  which  are  called  the  similaritv  classes.  It  is  a  standard  fact  that 
each  similarity  class  corresponds  to  all  possible  representations  of  some  T  £  Wom(M,M) , 
where  M  is  a  D-module  havino  a  basis  of  n  elements.  Indeed,  let  u1,...,um  be  a 
basis  in  M.  Then  T  is  represented  by  A  =  (a  )  £  m  (|») 

1  j  Tfl 

.  TT*  . 

(1.15.3)  Tu1  =  T  a  V,  i  =  1,...,m  . 

1-1  13 

Let  u1,...,um  be  another  basis  in  M.  Assume  that  O  is  a  unimodular  matrix  which  is 
oiven  bv  (1.9.9).  Then  accordino  to  (1.15.3)  and  the  arouments  of  Section  1.°,  the 
representation  of  t  in  the  hasis  u1,...,ul"  is  oiven  bv  the  matrix  9  of  the  t0rm 
(1.15.2). 

The  similaritv  notion  of  matrices  is  closelv  related  to  the  strict  eouivalencv  of 
certain  reoular  pencils. 

Lemma  1 . 1 5 . a .  Let  A , »  £  **  ( D )  and  associate  with  these  matrices  the  following  regular 
pencils 


(1.15.5) 


A(x)  =  -A+xI,  "(x)  =  -n+xT  . 


equivalent. 

Proof.  Assume  first  that  A  *  B.  Then  (1.15.2)  implies  (1.14.4)  where  P  =  Q_1.  Suppose 

now  that  A(x)  ~  B(x).  So 

s 


8  -  QAP.  QP  -  I  . 


That  is  P  =  Q-1  and  A  *  B. 

□ 

Clearly  is  A(x)  ~  B(x)  then  A(x)  ~  B(x).  So  we  have 
s 

Corollary  1.15.6.  Let  A,B  t  M^fD).  Assume  that  D  is  a  unique  factorization  domain. 
Assume  that  A  and  B  are  similar  then  the  corresponding  pencils  A(x)  and  B(x)  given 
by  (1.15.5)  have  the  same  invariant  polynomials. 

In  case  that  D  -  F  the  above  condition  is  also  a  sufficient  condition  in  view  of 
Lemma  1.15.4  and  Corollary  1.14.18. 

Theorem  1.15.7.  Let  A,B  e  NMF).  Then  A  and  B  are  similar  if  and  only  if  the 
pencils  A(  x)  and  B(x)  given  by  (1.15.5)  have  the  same  invariant  polynomials. 

It  can  be  shown  (see  Problem  1.15.8)  that  even  over  Euclidean  domains  the  condition 
that  A(  x)  and  B(x)  have  the  same  invariant  polynomials  does  not  imply  in  general  that 
A  *  B. 

Problems 
(1.15.8)  Let 


A  =  <„ 


.),  B 


<0  5  >  6  M2(Z) 


Show  that  A( x)  and  B(x)  given  by  (1.15.5)  have  the  same  invariant  polynomials  over 


Z!x].  Prove  that  A  and  B  are  not  similar  over  Z 


(1.15.9)  Let  A(x)  e  M  (OIDIxl  )  be  given  by  (1.15.5).  Let  i , ( x) , . . . , i  ( x)  be  the 
n  in 

invariant  polynomials  of  A{ x) .  Using  the  equality  (1,11.20)  prove  that  each  ik(x)  can 
be  assumed  to  be  normalized  polynomial  and 


|  (1.15.10)  £  deg  i  (x)  =  n  • 

l  k-1  k 

! 

* 

j 

•  (1.15.11)  Let  A  e  M  ( F) .  Show  that  A  35  At . 

I  n 


» 


.1 
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1.16  The  companion  matrix 

Theorem  1.15.7  shows  that  the  invariant  polynomials  of  xI-A  determine  the  similar  it 
class  of  A.  We  now  show  that  any  set  of  normalized  polynomials  i^(x),*.*,i(x)  UFD'x' 
such  that  i j( x) | i ^+1 ( x) #  j  =  1,...,n»1  and  which  satisfy  (1.15.10)  are  invariant 
polynomials  of  xI-A  for  some  A  e  Mn(UFD).  To  do  so  we  introduce  the  notion  of  a 
companion  matrix. 

Definition  1.16.1.  Let  p(x)  e  D[x]  be  a  normalized  polynomial 


p(  x)  =*  xm  +  a1xm“1  +  . . . 


Then  C(p)  =  (c..)„  e  M  (D)  is  called  the  comoanion  matrix  of  p(x)  if 
-  i]  1  m - * -  — 


(1.16.2)  c.  .  »  6, .  i  -  1,...,m-1,  j  =  1,...,m,  c  .  =  -a  ,  j  -  1 , .  * . ,  m 

lj  ( i+1 ) 3  mi  m-j  +  1  J 


Lemma  1.16.3.  Let  p(x)  e  UFD(x]  be  a  normalized  polynomial  of  degree  m.  Consider  the 
pencil  C(x)  -  xl  -  C(p).  Then  the  invariant  polynomials  of  C(x)  are 


(1.16.4) 


i 1 ( C )  *...*im-1(C)  =  1,  i m(C)  =  p(x)  . 


Proof.  For  k  <  m  consider  a  minor  of  C(x)  composed  of  the  rows  1,...,k  and  columns 

2,...,k+1.  Since  this  minor  is  the  determinant  of  a  lower  triangular  matrix  with  -1  or* 

the  main  diagonal  we  deduce  that  its  value  is  (-1)^.  So  6  (C(x))  =  1,  k  =  1,...,m-l. 

k 

This  establishes  the  first  equality  in  (1.16.4).  Clearly,  <5  (C(x))  =  |xI-C|.  Expand  t’*  » 

m 

determinant  of  C(x)  by  the  first  row  and  use  the  induction  hypothesis  to  prove 

|xI-C)  =  p(x).  This  shows  i  (C)  =  6  (C)/6  (C)  -  p( x) • 

m  m  m-1 


Using  the  results  of  Problem  1.12.10  and  Lemma  1.16.3  we  aet 
Theorem  1.16.5.  Let  p_.(x)  c  UFD[x|,  i  *  1,...,k  be  normalized  polynomials 
degrees  such  that  p . ( x) )p .  . ( x) ,  1  ~  1,...,k-1.  Consider  the  matrix 


(1.16.6) 


k 

C(p - -  )  =  I  9  C(p  )  . 

j=1  3 


Then  the  non-trivial  invariant  polynomials  of  xl  -  C( p 1 , • . . ,  p^)  (i.e.  those  polynomials 

which  are  not  the  identity  element)  are  p^ ( x) , . . . .p^f x) . 

Combining  Theorems  1.15.7  and  1.16.5  we  obtain  a  canonical  representation  for  the 
similarity  class  in  Mn(F).  __  - - - 

Theorem  1.16.7.  Let  A  e  M  (F)  ^'aWTr  assume  that  Pj(x)  e  F[x) ,  j  =  1,...,k  are  the  non¬ 
trivial  normalized  polynomials  of  xI-A.  Then  A  is  similar  to  C(p1 , . . . .p^) . 

Definition  1.16.8.  For  A  e  M^(F)  the  matrix  C( p^ , . . . .p^)  is  called  the  rational 
canonical  form  of  A. 

Let  D  be  an  integral  domain  and  denote  by  F  its  quotient  field.  Let  A  e  Mn(D). 

So  A  e  M^(F)  and  let  C(p1 , . . . ,pk)  be  the  rational  canonical  form  of  A.  We  now  examine 
the  case  when  A( p^ , . . . .p^ )  e  M^(D).  Assume  first  that  D  is  UFD.  Let  6^  be  the 
g.c.d.  of  k  x  k  xI-A.  So  6^  divides  a  minor  of  p(x)  »  (xI-A)[a|a),  a  *  {l,...,k}. 
Clearly  p(x)  is  a  normalized  polynomial  of  degree  k.  Recall  that  D[x]  is  also  UFD. 
(See  Section  1.4.) 

According  to  Theorem  1.4.12  the  decomposition  of  p(x)  into  irreducible  factors  in 
D(x]  if  of  the  form  (1.4.9)  where  a  =  1  and  each  qi(x)  is  a  non-trivial  normalized  and 
irreducible  polynomial  in  D[x] .  Since  6^  is  a  product  of  some  irreducible  factors  of 
p(x)  then  either  6^  =  1  or  6^  is  a  non-trivial  normalized  polynomial  in  D[x]  .  The 
same  argument  shows  that  i^  =  6^/6^^  is  either  identity  or  a  non-trivial  polynomial  in 
D[x).  Thus  we  demonstrated. 

Theorem  1.16.9.  Let  A  e  M  (D).  Assume  that  D  is  a  unique  factorization  domain.  Then 
the  rational  canonical  form  C(p1 , . . . .p^l  over  the  quotient  field  F  belongs  to  Mn(D). 

In  particular,  we  have 

Corollary  1.16.10.  Let  A  e  (C[x^ , . . . ,x  ] ) .  Then  the  rational  canonical  form  A 
belongs  to  M^tCf x1 , . . . ,xn) ) . 
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Using  the  results  of  Theorem  1.4.13  we  deduce  that  Theorem  1.16.9  applies  to  the  ring 


t 

t 


of  analytic  functions  in  several  variables  although  this  ring  is  not  UFD  (see  Section 
1.3). 

Theorem  1.16.11.  Let  A  e  M  (H(Sl))(fi  £  Cm).  Then  the  rational  canonical  form  of  A  over 
the  field  of  meromorphic  functions  belongs  to  M^(H(0)). 

Problems 

(1.16.12)  Let  p(x)  e  UFDfx]  be  a  normalized  non-trivial  polynomial.  Assume  p(x)  * 
p^xjpjfx).  where  p^(x)  is  normalized  non-trivial  polynomial  in  UPD[x)  for  i  *  1,2. 
Using  Problems  1.12.9  and  1.12.10  show  that  xl  -  C(p^,P2>  given  by  1.16.6  has  the  same 
invariant  polynomials  as  xl  -  C(p)  if  and  only  if  (p^pj)  “  1. 

(1.16.13)  Let  A  e  M^(UFD)  and  assume  that  p1 (x) , . . . ,pk( x)  are  the  non-trivial 
normalized  invariant  polynomials  of  xI-A.  Let 

(1.16.14)  Pj(x)  =  (  x)l  '  -1 .  .  .  tv>4(  x)  ]  3,  j  -  1 , . . .  ,k 


where  * ( x) , . . . ( x)  are  non-trivial  normalized  irreducible  polynomials  in  DFD[x]  such 
that  j)  =  1  for  i  ¥  j.  Prove  that 


(1.16.15) 


"ik  *  1  *  1 . *'  "ik  >  mi(k-1 )* "  ’>m11  *  °  ' 

£,k 

I  “  n  ’ 

i,j-1  13 


The  polynomials  *  for  mij  >  0  are  calletl  the  elementary  divisors  of  xI-A.  Using 

the  above  problem  show  that  xI-A  and  xI-E  where 


(1.16.16) 


E  =  V  9  C(>r 
-  1 
m,  .>0 
i  3 


have  the  same  invariant  polynomials.  Thus  over  a  field  F  A  25  E.  Sometimes  E  is  called 
the  rational  canonical  form  of  A. 
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1.17  Splitting  to  invariant  subspaces 

Let  V  be  a  vector  space  of  dimension  m  over  a  field  F.  Denote  by  L(V]  the 
vector  space  of  all  linear  transformations  T  :  V  +  V.  That  is 

(1.17.1)  LfV)  -  Hom(V, V)  . 

Let  T  £  L(V).  As  we  pointed  out  in  Section  1.15  the  set  of  all  matrices  A  £  M^P)  which 
represent  T  in  different  bases  is  exactly  an  equivalence  class  of  matrices  with  respect 
to  the  similarity  relation.  Theorem  1.15.7  shows  that  the  class  A  is  characterized  by 
the  invariant  polynomials  of  xI-A  for  some  A  e  A.  Since  xI-A  and  xI-B  have  the  same 

invariant  polynomials  if  and  only  if  A  %  B  we  define. 

Definition  1.17.2.  Let  T  £  L(V)  and  let  A  £  M^tF)  be  a  representation  matrix  of  T  in 
a  basis  u 1 , . . . , u1"  given  by  the  equality  (1.15.3).  Then  the  invariant  polynomials 
Pi (x) , . . . »pm(x)  c>f  T  are  defined  as  the  invariant  polynomials  of  xl-A.  The 
characteristic  polynomials  of  T  -  is  the  polynomial  |xI-A| . 

The  fact  that  the  characteristic  polynomial  of  T  is  independent  of  a  representation 
matrix  A  follows  from  the  identity  (1.11.20) 

(1.17.3)  |xI-A|  *  p1 (x) . . .p^l x) 

where  p^ (x) , . . . ,p^{ x)  are  non-trivial  invariant  polynomials  of  xl-A.  In  Section  1.16  we 

proved  that  the  matrix  C( p, , . . . ,p^)  is  a  representation  matrix  of  T.  In  this  section  we 

shall  consider  another  representation  matrix  A  of  T  which  is  closely  related  to  the 
matrix  E  (1.16.16).  This  form  will  be  achieved  by  splitting  V  to  a  direct  sum 

(1.17.4)  V  -  U.e  M), 

1  i 

where  each  llj  is  an  invariant  subspace  of  T. 

Definition  1.17.5.  A  subspace  U  £V  is  an  invariant  subspace  of  T  1  f 


r 
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(1.17.6) 


Hie  U  . 


0  is  called  trivial  if  U  ■  [0].  0  is  called  proper  if  0  |  V.  0  is  called  irreducible 

if  U  cannot  be  expressed  as  a  direct  sum  of  two  non-trivial  invariant  subspaces  of  T. 

Thus  if  V  splits  to  a  direct  sum  of  non-trivial  invariant  subspaces  of  T  then  a 
direct  sum  of  matrix  representations  of  the  restrictions  of  T  to  0^  gives  a 
representation  matrix  of  T.  So,  a  simple  representation  of  T  can  be  achieved  by 
splitting  V  to  a  direct  sum  of  irreducible  invariant  subspaces.  To  do  so  we  need  to 

introduce  the  notion  of  the  minimal  polynomial  of  T.  Consider  the  linear  operators 
2  m2 

I,T,T  ,...,T  ,  where  1  is  the  identity  operator  (Ix«x).  Since  the  dimension  of  L(V) 

2  2 

is  m  these  m  +  1  operators  are  linearly  dependent.  So  there  exists  an  integer  q 
such  that  I,T, . . .  ,T<3"'1  are  independent  and  I.T,...,^  are  linearly  dependent. 

Definition  1.17.7.  A  polynomial  i(i(x)  £  F[x]  is  called  the  minimal  polynomial  of  T  if 


i|i(x)  la  a  normalized  polynomial  of  the  smallest  degree  satisfying 


(1.17.8) 


1|>(T)  •  0  . 


i  1  i 

XT)  =  l  c  r,  <J>(x)  -  l  c  X  e  F[x) 
i-C  1  i-0 


and  0  is  the  zero  operator  (0x”0).  By  the  definition  deg  ij/  >  1 .  The  minimal 
polynomial  is  characterized  by  the  following  property. 

Lemma  1.17.9.  Assume  that  T  annihilates  d  e  F[x].  That  is  p(T)  =  0.  Then 


Proof .  Divide  ip  by  ip 


$(x)  »  x(x'4Xx)  +  p(x),  deg  p  <  deg 


Now  (1.17.8)  and  the  assumption  of  the  lemma  imply  that  p(T)  =  0.  As  deg  p  <  deg  l|/  from 


the  definition  of  the  minimal  polynomial  we  deduce  that  p(x)  =  0. 

□ 

Since  F[x]  is  a  unique  factorization  domain,  let 

S1  S£ 

(1.17.10)  ij/(x)  =  v'^x)  •••v'i(x>  ,  <*if*  )  =  1,  for  1  <  i  <  j  <  l, 

deg  V  >  1,  i=  1 , .. .  ,Jl 

where  each  ^(x)  is  a  normalized  irreducible  polynomial  in  F[x]. 

Theorem  1.17.11.  Let  i(i(x)  be  the  minimal  polynomial  of  T.  Assume  that  ^  splits  to  a 

product  of  co-prime  factors  as  given  in  (1.17.10).  Then  the  space  V  splits  to  a  direct 

s . 

sume  (1.17.4)  where  each  is  a  non-trivlal  invariant  subspace  of  T.  Moreover  'f  ^(x) 

is  the  minimal  polynomial  of  the  restriction  of  T  to  . 

The  proof  of  the  theorem  follows  immediately  from  the  lemma  below. 

Lemma  1.17.12.  Let  iji  be  the  minimal  polynomial  of  T.  Assume  that  splits  to  a 
product  of  two  co-prime  factors 


(1.17.13) 


iji(x)  »  ^  ( x) < x)  ,  deg  ^  >  1,  i  »  1,2,  (V',  ,<'2 >  *  1 


where  each 


is  normalized.  Then 


(1.17.14) 


V  -  0,  0  u2  , 


where  each  Pj  is  a  non-trivlal  Invariant  subspace  of  T  and  is  the  minimal 

polynomial  of  the  restriction  of  T  _to  . 

Proof.  The  assumption  of  the  lemma  imply  the  existence  of  polynomials  8^(x)  and  0^(x) 
sucht  that 


(1.17.15) 


91  ( x)i|»j  (x)  +  S2  (x)Y2(x)  =  1  . 
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Define 


(1.17.16)  -  (ulu  e  V,  <yT)u  -  0},  j  -  1,2  . 

Since  any  two  polynomials  in  T  commute,  i.e. 

U(T)v(T)  -  v(T)u(T) 

we  clearly  have  that  each  0^  is  an  invariant  subspace  of  T.  The  equality  (1.17.15) 
implies 

X  -  V,  (T)81  (T)  +  V2(T)82(T)  . 

That  is,  for  any  u  c  V  we  have 

u  -  u  ♦  u2,  u  -  -,2<T>e2(T)u  c  -  ^(Tje^TJu  e  02  . 

So 

U,  +  u2  -  v  . 

Suppose  that  u  c  '  U2.  Then 

V1  (T ) u  -  *2  (T)u  ■  0  . 

Thus 


S,(T)v,(T)u  -  8  (T)i>  (T)u  -  0  . 


Finally 


u  *=  f  3t  (T>^1  (T )  +  92<T)*2<T)!u  =  0 

which  proves  that  n  D2  =  [01.  This  establishes  (1.17.14).  Let  Tj  be  the  restriction 
of  T  of  .  By  the  definition  of  (1.17.16)  T  j  annihilates  if^..  Let  v ^  be  the 

minimal  polynomial  of  T^.  So  1  'f' j »  j  =  1,2.  Now 

4»1  <T  >^2  (T)u  =  (T  )<j»2  <T  )  (U1  +  a,)  =  ^  (T ) ^  (T )  ul  +  ^  (T)^  (T )  =  0  . 

Therefore  (1.17.14)  yields  that  T  annihilates  i|i^  i|;  .  Since  if(x)  is  the  minimal 

polynomial  of  T  we  have  if  i^2 1  ^  ■  This  finally  implies  if^  *  if^ ,  j  =  1,2.  Also  as 

deq  0  >  1  it  follows  that  dim  O.  >  1. 

'  1  1 

Problems 

(1.17.17)  Assume  that  (1.17.14)  holds,  where  TUj  2_  ,  j  =  1,2.  Let  if^  be  the  minimal 

polynomial  of  the  restriction  of  T  to  Uj ,  j  =  1,2.  Prove  that  the  minimal  polynomial 

of  T  is  equal  to  i{i  Vj/<  '  • 

(1.17.18)  Let  the  assumptions  of  Problem  (1.17.17)  hold.  Assume  furthermore  that  v  - 
where  *  is  irreducible  over  F(xl.  Then  either  if  =  if  or  V  =  if^. 

(1.17.19)  Let  C  «  C(p)  c  M  (D)  be  the  companion  matrix  qiven  by  (1.16.2).  Let 

nt 

e.  =  (5,.,...,S  1 ,  i  »  1,...,m,  be  a  standard  basis  in  Dm.  Show 

l  il  lm 


(1.17.20) 


i-1 


-  a  E  , 

m-i+1  m 


1 , . . . ,m  (c 


0) 


Prove  that  p(C)  =  0  and  that  any  polynomial  0  *  q(x)  £  D(xl,  dec  q  <  m,  is  not 
annihilated  by  C.  (Consider  q(C)t'^  and  use  (1.17.20).)  That  is  p(x'  is  the  minimal 
polynomial  of  C(p). 
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(1.17.21)  Let  ACM  (F).  Using  Theorem  1.16.7  and  Problems  1.17.17  and  1.17.19  show  th 

m 

the  minimal  polynomial  tj;  of  A  is  the  last  invariant  polynomial  xI-A.  That  is 

(1.17.22)  H>(x)  -  |xI-A|/6  ,(x)  , 

m-  7 

where  6  (x)  is  g.c.d.  of  all  (m-1)  *  (m-1)  minor  of  xI-A. 

Rl—  1 

(1.17.23)  Show  that  the  results  of  Problem  1.17.22  apply  to  ACM  (OFD).  In  Darticular 

Itl 

if  A  B  then  A  and  B  have  the  same  minimal  polynomials. 

(1.17.24)  Deduce  from  Problem  (1.17.21)  the  Cayley-Hamllton  theorem  which  states  that 
T  e  L(V)  annihilates  its  characteristic  polynomial. 

(1.17.25)  Let  ACM  (D).  Prove  that  A  annihilates  its  characteristic  polynomial. 

m 

(Prove  this  result  by  considering  the  quotient  field  of  D. ) 

(1.17.26)  Use  Problem  (1.17.24)  and  Lemma  1.17.9  to  show 

(1.17.27)  deg  <|i  <  dim  V  . 

(1.17.28)  Let  i(i  -  V3  where  v*  is  irreducible  in  F[x)  and  assume  that  deg  v  *  dim  V 
Use  Problem  (1.17.18)  and  (1.17.26)  to  show  that  V  is  an  irreducible  invariant  subspace 
of  T. 

(1.17.29)  Let  p(x)  e  F[x]  be  a  non-trivial  normalized  polynomial  such  that  p  =  «8 
where  *(x)  is  irreducible  in  F(xJ.  Let  T  e  L(V)  be  represented  by  C(p).  Use  Froble 
(1.17.28)  to  prove  that  V  is  an  irreducible  invariant  subspace  of  T. 

(1.17.30)  Let  T  c  L(V)  and  let  E  be  the  matrix  given  by  (1.16.16'  which  is  determine 
by  the  elementary  divisors  of  T.  Using  Problem  (1.17.29)  prove  that  the  representat ion 
E  of  T  corresponds  to  a  splitting  of  V  to  a  direct  sume  of  irreducible  invariant 
subspaces  of  T. 

(1.17.31)  Deduce  from  Problems  (1.17.28)  and  (1.1'. 30'  that  V  is  an  irreducible 

satisfies  the  assumptions  o' 


invariant  subspace  if  and  only  if  the  minimal  polynomial 


1.18  An  upper  triangular  form 

Definition  1.18.1.  Let  M  be  a  D-module  and  assume  that  T  e  Hom(M,M).  A  e  D  is  called 
an  eigenvalue  if  there  exist  0  f  u  e  M  such  that 

(1.18.2)  Tu  =  Au  . 

The  element  (vector)  u  is  called  eigenelement  (eigenvector)  corresponding  to  A.  An 
element  0  ^  u  is  called  generalized  eigenelement  (eigenvector)  if 

(1.18.3)  <AI-T)ku  =  0 

for  some  positive  integer  k  where  A  is  an  eigenvalue  of  T.  For  T  £  M^ID)  ^  is 
called  eigenvalue  if  (1.18.2)  holds  for  some  0  f  u  e  Dm.  The  element  u  is  called 
eigenelement  (eigenvector)  or  generalized  eigenvector  if  (1.18.2)  oi_  (1.18.3)  holds 
respectively . 

Lemma  1.18.4.  Let  T  £  M^tD) .  Then  A  is  an  eigenvalue  of  T  if  and  only  if  A  is  a 
root  of  the  characteristic  polynomial  |xX-T| . 

Proof.  Let  F  be  the  quotient  field  of  D.  As  (1.18.2)  is  equivalent  to 

(Al-T)u  =  0 

by  the  definition  u  /  0,  so  the  above  system  has  a  non-trivial  solution.  Therefore 
| AX  — T |  =  0.  That  is  A  is  a  root  of  the  characteristic  polynomial  of  T.  Vice  versa,  if 
| A I  — T |  =  0  then  the  above  system  has  a  non-trivial  solution  u  £  Fm.  Clearly  au,  a  E  D 
also  satisfies  the  above  equality.  Choose  a  f  0  such  that  au  £  D™.  Thus  A  is  an 
eigenvalue  of  T. 

Theorem  1.18.5.  Let  T  c  M^(D).  Assume  that  the  characteristic  polynomial  of  T  splits 
to  linear  factors  over  D 
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Clearly,  the  upper  triangular  form  of  A  is  not  unique  unless  A  =  al.  See  Problem 


1.18.9.  In  what  follows  we  shall  use  the  definitions  given  below. 

Definition  1.18.9.  Let  T  £  M^(D)  and  assume  that  (1.18.6)  holds.  Then  the  spectrum  of 
T  -  o(T)  is  defined  to  be  the  set 


(1.18.10) 


o (T )  =  (A, , ♦. . , A  } 

t  m 


By  denote  the  multiplicity  of  that  the  number  of  times  that 

a(T).  The  eigenvalue  A^  is  called  algebraically  simple  if  itk  =  1. 
all  the  distinct  eigenvalues  of  T.  That  is 


A .  is  appearing  in 
Let  A i , . . • , A  be 


£ 

(1.18.11)  l  m.  =  m,  \  c  o(  T),  i  «  1 , . . . ,  C  . 

i=1  1  1 

By  o^fT)  we  denote  the  distinct  spectrum  of  T 
(  1.18.12)  od(T)  =  U  . 

Problems 

(1.18.13) .  Let  Q  correspond  to  the  elementary  row  operation  described  in  Definition 
1.10.11  -  (iii).  Assume  that  A  is  an  upper  triangular  matrix.  Show  that  if  j  <  i 
then  QAQ-1  is  also  an  upper  triangular  matrix. 

(1.18.14)  Prove  that  if  T  e  M  (D)  is  similar  to  an  upper  triangular  matrix  A  e  M  (D) 

m  m 

then  the  characteristic  polynomial  of  T  splits  to  linear  factors. 

(1.18.15)  Let  T  e  M  (D)  and  put 

m 

m 

(1.18.16)  Ixl-T|  =  x"  +  V  a  V  . 

jii 
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Assume  that  the  assumptions  of  Theorem  1.18.5  hold.  Show  that 


(1.18.17) 


(-1  )  ^  =  l  Tto|o)  =  *  (X  , 
aE2k(m 


in 


where  s^(  x.,  , . . . ,  xm)  is  the  k-th  symmetric  polynomial  of  The  coefficient 

-a1  is  called  the  trace  of  A(tr(A)).  That  is 


(1.18.18) 


m  m 

tr(A)  »  I  a  -  l  X  . 
i-1  i=l 


(1.18.19)  Let  T  e  M  (D)  and  suppose  that  the  assumptions  of  Theorem  1.18.5  hold.  Assume 
m 

that  D  is  OFD.  Using  the  results  of  Theorem  1.18.5  and  Problem  1.17.23  prove  th»t  the 
minimal  polynomial  i |i(x)  of  T  is  of  the  form 


(1.18.20)  *<x)  -  n  <x-X  )  ,  X  *  X  for  i  *  j,  1  <  s  <  m  ,  i,j  -  1,...,t 

i-1  1  1  3  11 


where  o.(T)  »  {X . , . . .  ,A . } .  (Hint!  Consider  the  diagonal  elements  of  \|/(A).) 
d  lx 

(1.18.21)  Let  T  E  M  (UFD)  and  assume  that  the  minimal  polynomial  of  T  is  given  by 
m 

(1.18.20).  Using  Problem  1,17.21  and  the  equality  (1.17.3)  prove 


(1.18.22)  Ixl-T|  -  n  (x-X  )  .  X  /  X  for  i  j  j.  i,j  -  1, 

i-1  X  13 


1.19  Jordan  canonical  form 


i 

\ 


Theorem  1.18.5  and  Problem  1.18.14  shows  that  T  e  M  (D)  is  similar  to  an  upper 

m 

triangular  matrix  A  if  and  only  if  the  characteristic  polynomial  of  T  splits  to  linear 
factors.  Unfortunately  the  upper  triangular  form  of  T  is  not  unique.  In  case  that  D  is 
a  field  there  exists  an  upper  triangular  matrix  A  which  depends  only  on  the  eigenvvalues 
of  T  and  this  matrix  is  essentially  unique.  For  convenience  we  state  the  theorem  of  an 
operator  T  e  L(V). 

Theorem  1.19.1.  Let  T  c  L(V).  Assume  that  the  minimal  polynomial  ij»(x)  _of  T  splits  to 
a  product  of  linear  factors  as  given  in  (1.18.20).  Then  V  splits  to  a  direct  sum  of  non¬ 
trivial  irreducible  invariant  subspaces  of  T 


( 1. 19.2) 


V  =  w  «•••«» 

i  q 


In  each  subspace  W  it  is  possible  to  choose  a  basis  consisting  of  generalized 
eigenvectors  x^,...,xr  such  that 


(1.19.3) 


Tx1  =  ^x1  , 


(1.19.4) 


k+ 1  k+ 1  k 

Tx  =  X^x  +  x  ,  k  =  1,...,r  -  1 


where  X„  is  equal  to  X.  for  some  i  and  r  <  s.  (in  case  that  r  =  1  (1.19.4)  is 

void) .  Moreover  for  each  X^(  i  =  1, there  exists  an  invariant  subspace  of  W 

whose  basis  satisfies  (1.19.3)  -  (1.19.4)  with  X„  =  X,  and  r  =  s, • 
-  -  0  l  -  i 

Proof .  Assume  first  that  the  minimal  polynomial  of  T  is  of  the  form 

(1.19.5)  x)  »  xS  . 

So  Ts  =  0  and  Ts_1  f  0.  Let  x,1,...,x  1  span  the  range  of  Ts_1 


TS_1V  -  [x11 . x1"1] 


In  particular  x  are  linearly  independent.  Let  x*  be  the  pre- 


images  of  x31,...,x  '  for  the  map  T0-':  V  *  V.  So 


In 


,s— 1 


T8-1x83  -  x’i,  j  -  1 . n,  . 


Denote 


^s-kjj  ,  T*xsi,  k  -  1,...,s  -  2,  j  »  . 


As 


Ts  •  0  we  have  Tx1 3  a  o,  j  -  1,...,n..  The  two  equalities  above  are  equivalent  to 


(1.19.6) 


Tx1 ^  -  0 ,  Tx 


(k+1)j  m  xhj(  ^  ,  1,,.,,s  -  1,  j  *  1,...,n1 


we  claim  that  x11,...,x  1 , . . . ,xa ' , . . . ,x  '  are  linearly  independent.  Indeed,  suppose 

that 


In. 


si 


‘1 


I  I  v^-o  . 

i-1  j*=1  13 


1 

1  i 

u 


Apply  T8”1  to  this  equality.  The  (1.19.6)  yields 


l  8sjx13  -  0 

J-1  3 


11  »•»  4  g-2 

As  x  ,  . . • , x  are  linearly  independent  ag1  *...*a8n^  ■  0*  Next  apply  T  to 

obtain  a,  -  0.  Continuing  in  the  same  manner  we  deduce  that 

l s- 1 l 1  l S  « ini 
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i  -  1 


s,  j  -  1 


n1  are  linearly  independent.  Assume  now  that  we  have  found 


x  I 


linear  independent  vectors 


1,...,s-r+1,  j  =  mr_ 1 + 1 , • • • ,mr,  r  «  1,...,p  , 


(1.19.7) 


mg  =  0,  mr  =  n1+...+nr,  r 


1,...,p,  n  >  1,  n  >  0,...,n  >  0 

1  2  p 


for  1  <  p  <  s,  such  that  (1.19.6)  holds  for  k  =  1,...,s-r,  j  =  m  ^  +  1, . . . ,mr, 
r  =  1 , . • • ,p  and 


(1.19.8) 


-P  . 


V  =  [x 


11 


1m. 


.  ,x 


■P1 . 


pm^  1  ( m^  + 1 ) 


Im, 


(p-1)(m1+1)  (P-1)*2  1 (m  -+1 )  1m 

X  f  •  •  •  ,  X  P  1  Pi 

»•••("  »•••»«  I  • 


For  p  *  s  we  found  the  needed  basis  of  V.  For  p  <  s  consider  the  subspace  Vs_p-1 


rs-P-1V.  Clearly 

2<V1+1)  x2mP 


11 


1m, 


x!p+1 ) 1  _{^1,»1 


'V1 


1mp 

»x  P, 


belong  to  Recall  that  these  vectors  were  assumed  to  be 

linearly  independent.  If  the  above  vectors  span  Vs_p-1  put  rp+)  =  0,  mp+i  "  mp. 
Otherwise,  let 


vs-p-1 


[x11 . x1m’ . x'P*1'1 . x(P+1)m’ . x1(mP-1  +  1> . x,mP 


2 ( m  , + 1 )  2m  1(m+1) 

X  P  ’  ,  .  .  .  ,  X  P,  X  P 


1m  , 
,X  P+1l 


Since  Tx'3  €  V  ,  j  =  m  +  1,...,m  .  in  view  of  (1.19.8)  and  (1.19.6)  we  can 

s-p  J  p  p+1 

assume  that  Tx1  3  *  0,  j  *  mp  +  1,...,m^.j  (by  adding  to  x13  a  linear  combination  of 
vectors  appearing  in  (1.19.7)1.  Denote  x<B“P)j  the  pre-images  of  x1  3  for  the  map 
Ts-p-1,  y  *  v  for  j  =  m  +1,...,mpM.  Also  let  x<  s— p-k )  j  _  Tkx<s-p)jf  k  ,  1,...,s-p-2 
Thus  (1.19.6)  holds  for  k  *  1,...,s-r,  j  =  m^^  +  l , . . .  ,mr ,  r  =  1,...,p+1.  We  claim  that 


the  vectors  appearing  in  (1.19.7)  for  r  -  1,...,p+1  linearly  independent.  This  follows 
by  applying  , . . . .T1 ,  T°  -  I  to  the  given  vectors  and  using  the  identities  (1.19.6) 

for  all  the  involved  vectors  and  taking  in  account  that  the  vectors  which  span  Vg_p_.j  are 
linearly  independent.  For  each  p  auch  that  mp  >  mp_1  let 


,x(#-p+1)3J,  m  +  1  <  j  <  m 

p-l  p 


So  (1.9.3)  -  (1.9.4)  holds  ior  »  0  and  r  -  a-p+1.  As  all  x13,  i  -  1,...,s-r+1, 

J  -  mr_1  +  1,...,mr,  r  ■  1,...,s,  the  equality  (1.19.2)  holds.  Also  m1  »  n1  >  0  and 
dim  W1  ■  s.  It  is  left  to  show  that  ia  an  irreducible  invariant  subspace  of  T.  From 

the  equalities  (1.19.6)  we  get  that  if  x'a  p+1'3  e  0,  where  0  is  an  invariant  subspace 
of  Wj  than  x*3  e  U  for  k  ■  a-p+1,...,1.  So  0  “  Wj  and  is  irreducible.  This 

proves  the  theorem  in  case  that  the  minimal  polynomial  of  T  is  of  the  form  (1.19.5). 
Assume  that  the  minimal  polynomial  of  T  ie  of  the  form  (1.18.20).  According  to  Theorem 
1.17,11  V  splits  to  the  direct  sum  of  non-trlvial  invariant  subspaces  (1.17.4)  such  that 
the  minimal  polynomial  of  Tj  -  the  restriction  of  T  to  is  (x-\^)  J.  Call 


')  •  V  -  W  V  ’ . *  • 

s. 

Clearly,  the  minimal  polynomial  of  Bj  is  x  J.  So  we  can  apply  our  results  for  Qj, 
j  -  1,...,i.  This  immediately  Implies  the  theorem. 

□ 

Let  H(n)  be  the  following  n  *  n  matrix 


(1.19.9)  H(n)  -  (hij)",  htj  -  «(i+  ,  i,j  -  1 . .  . 

That  is  H(n)  is  0-1  matrix  of  the  form 
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H(n) 


1  0  .  0 

0  1  .  0 

0  0  .  0 

0  0  .  0 


We  shall  also  denote  H(n)  by  Hn  or  simply  H  in  case  that  the  dimension  of  H  is  well 
defined.  Let  V  =  [v1,...^1]  when  v*  =  xr_X+\  i  =  1,...,r  and  x\...,xr  satisfy 
(1.19.3)  -  (1.19.4).  Then  in  this  basis  T  is  represented  by  a  Jordan  block  X^I  +  H  of 
dimension  r.  Thus  Theorem  1.19.1  implies: 

Theorem  1.19.10.  Let  A  c  NMF).  Assume  that  the  minimal  polynomial  iKx)  A  splits 

to  linear  factors  as  given  in  (1.18.20).  The  there  exists  a  non-singular  matrix 

P  £  M  (F)  such  that 
n  - 

1  4  ^ 

P  AP  =  J,  J  =  I  T  «  (X.I(m. .)  +  H ( m  .))  , 

(1.19.11)  i-1  j=1  1  1]  13 

1  <  m.  <  m.  .  <...<  m. .  =  s.  ,  i  *  1, ...,£,  X.  f1  X.,  for  i  /  j 

lqi  lqi*’  ll  i  i  j 

Definition  1.19.12.  The  matrix  J  appearing  in  (1.19.11)  is  called  the  Jordan  canonical 
form  of  A(T).  The  polynomials 

m.  . 

(1.19.12)  v'ij(x)  =  <x_Xi  •  13>  3  ”  i  =  1 . 1  , 

are  called  the  elementary  divisors  of  ACT). 

Remark  1.19.13.  In  case  that  the  minimal  polynomial  of  A  does  not  split  to  linear 
factors  in  F  we  can  find  a  finite  extension  field  K  such  that  ^  splits  in  K.  Then 

(1.19.11)  holds  for  P  £  M  (K).  We  shall  refer  to  J  as  the  Jordan  canonical  form  of  A. 

n 

Theorem  1.19.14.  Let  A  r  M^(F).  Assume  that  the  minimal  polynomial  of  A  is  of  the  form 
given  by  (1.18.20).  The  the  elementary  polynomials  of  A  are  the  elementary  divisors 


of  xI-A  defined  in  Problem  1.16.13.  That  is,  put 


(1.19.15) 


m ,  ,  “..."in  *  0 ,  1  —  I.... 

iq^+1  in 


Than  the  invariant  polynomials  i1 ( x) , . . . , in( x)  £f  xI-A 


(1.19.16) 


i  <x)  -  n  (x-X  )  1(n~r+1),  r  -  T 
r  i-1  1 


,i  a 


are  given  by  the  equalities 


n 


In  particular  if  p,  ( x)  , . . .  ,pk(  x)  are  the  non-trlvlal  invariant  polynomials  of  xI-A  then 


i  1 

(1.19.17)  p  (x)  -  n  (x-X  )  j  -  1 . k  . 

i-1  1 

Proof.  Assume  first  that  A  »  XQI(»)  +■  Him)  .  Then  for  1  <  k  <  m-1  the  minor  of 

xI-A  composed  of  the  rows  1,...,k  and  the  columns  2,,..,k+1  is  equal  to  (-I)*.  So 

the  first  m-1  determinant  Invariants  are  trivial.  Also  5  (xI-A)  -  |xI-A|  «  (x-X„)™. 

m  0 

Hence 

i  qi  m 

xI-A  ~  xI-J  ~  l  l  9  diag(1,...,1,  (x-X  >  3)  . 

i-1  j-1 

Applying  the  results  of  Problems  (1.12.9)  -  (1.12.10)  we  deduce  the  equality  (1.19.16). 
Clearly  (1.19.16)  is  equivalent  to  (1.19.17). 

□ 

This  theorem  shows  that  the  Jordan  canonical  form  of  A(T)  is  unique  up  to 
permutation  of  Jordan  blocks. 

Problems 

(1.19.18)  Show  directly  that  to  each  eigenvalue  X  of  a  companion  matrix  C(p) 

o 

corresponds  one  linear  independent  vector  of  the  form  ( 1  ,Xo, . . .  ,Xon  S4. 
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fc’,  (eo-  ■  * 


(1.19.19)  Let  A  c  M^(C)  .  Assume  that  X  e  0(A).  Let  £  C°  be  the  subspaces  of 

all  eigenvectors  of  A  and  Afc  respectively  corresponding  to  X.  Show  that  there  exists 
bases  x\...,xm  in  U1  and  in  Uj  such  that  (y^J^x^  =  S  , 

i , j  =  1 , . . . ,m. 

(Hint  Assume  first  that  A  is  in  the  Jordan  canonical  form.) 

(1.19.20)  Let 


Ax  =  Xx,  Aty  =  py ,  ACM  (C) ,  0  f  x,y  c  0° 
n 

Show  that  if  \  ?  u  then  yfcx  =  0. 

(1.19.21)  Verify  directly  that  J  annihilates  its  characteristic  polynomial.  Using  the 

fact  that  any  A  c  M  (F)  is  similar  to  its  Jordan  canonical  form  over  a  finite  extension 
n 

field  K.  deduce  the  Cayley-Hamilton  theorem. 

(1.19.22)  Let  A,B  c  M  (P)  show  that  A  *  B  if  ar.d  only  if  A  and  B  have  the  same 

n 

Jordan  canonical  form. 


i 


i 


J 


1 .20  Some  applications  of  Jordan  canonical  *orm. 

Definition  1.20.1.  Let  A  €  M  (F)  and  assume  that  splits  in  F.  Let  \n  *v»  a~ 

eigenvalue  of  A.  Then  the  number  or  factors  o*  the  *orm  (x-Xp)  appearing  in  the 
polynomial  iHxt  of_  A  Is  called  the  index  of  and  is  denoted  hy  index  (\ni.  "‘he 

number  of  linearly  Independent  eigenvectors  of  A  corresponding  to  X  is  called  t*e 
geometric  multiplicity  of  Xp . 

Using  the  results  of  the  previous  section  we  get 
Lemma  1.20.2.  Let  the  assumptions  of  Peeinition  1.20.1  hold.  ■nv.prt  index  (1.1  is  tve 
site  of  the  largest  Jordan  block  corresponding  to  Xp  (i.e.  of  the  *orm  Xpt*u\  and  the 
geometric  multlpl icltv  of  Xp  is  the  number  of  the  Jordan  hlochs  corresponding  to  X n 
which  appear  In  the  Jordan  canonical  form  o*  *. 

Let  T  e  I.(V) ,  Xp  e  o(t)  and  consider  invariant  suhspaces 

(1.20.3)  X  -  ( x I x  £  V,  (\  T-’T)rx  -  0),  r  -  0,1 . 

T  0 

Using  the  decomposition  (1.1Q.2)  and  the  definition  o*  dordan  *orm  o*  T  we  obtain 

Theorem  1.20.4.  Let  T  £  L(V)  and  assume  that  X  is  an  eigenvalue  e*  m.  Let  in*ex 

»  m  >  m  A,..»n  A  1  he  the  dimensions  of  all  doping  higgles  rorresppndigg  rc  \  w'-ir‘- 

12  a  -  o  - 

appear  in  Jordan  canonical  *orm  of  t,  Then 


(1.20.*;) 


dim  X 


i-1 


min!  r ,m^ ' , 


In  particular 


(1.20. A) 


""  *  yo  V  yi 


V  y2  S 


irvVx  (V^ 


-fl  *  - 


Also  note  in  view  of 


Thus  (1.20.6)  ciives  yet  another  character izat ion  of  the  index  of 
Definition  1.18.1  each  consists  of  generalized  eigenvectors  of  T. 

Definition  1.20.7.  An  operator  T  :*  L ( V )  is  said  to  have  a  simple  structure  if  there 
exists  a  basis  in  V  which  consists  entirely  of  eigenvectors  of  T.  That  is  any 
representation  matrix  A  jof  T  is  djagonahle  (i.e.,  similar  to  ^  diagonal  matrix). 

For  such  T  we  must  have  .  Theorem  1.19.1  yields. 

Theorem  1.20.6.  Let  T  t  L(V'.  Then  T  has  a  simple  structure  if  and  only  if  the  minimal 
polynomial  y  of  T  splits  to  linear  factors  such  that  any  two  factors  in  ^  are 
relatively  prime.  That  is  the  index  of  any  eigenvalue  of  T  equals  to  1 • 

Definition  1.20.9.  Let  T  i  Hon(M,M)  where  M  is  a  p-module.  T  is  called  nilpotent 
if  Ts  =  0  for  some  positive  integer  s. 

We  need  in  the  sequel  the  following  result. 

Theorem  1.20.10.  Let  T  c  L(V)  be  njlpotent.  Assume  that  U  is  a  non-trivial  invariant 
subspace  of  T  such  that 


(1.20.11) 


TV  -  U 


Then  it  is  possible  to  split  V  to  the  direct  sum  of  invariant  subspaces  (1.19.2)  and  to 

i  1  iri 

choose  in  each  invariant  subspace  basis  y  ,...,y  satisfying  (1.19.3)  -  (1.19.4) 

i 1  iri* 

(with  \p  =  0 )  such  that  the  vectors  y  , ...  ,y  ,  r^-1  *  r^  *  r^ ,  i  *  form  a 

basis  in  U. 

Proof .  The  proof  of  the  theorem  is  a  modification  of  the  proof  of  Theorem  1.19.1  and  we 
point  out  the  changes  one  needs  to  make.  Let  x1^1,  i  «*  1 ,  •  •  • , s-r+ 1 ,  j  =  mr_j+1 ,  •  •  •  ,nr ,  r  - 

1 . q  be  a  basis  of  V  satisfying  (1.10.6).  Then  the  condition  (1.20.11)  implies 

x1"7  •:  U,  i  '  1,...#s-r,  i  =  ^♦1#*..,m  ,  r  -  1#...,s-1.  We  choose  now  the  vectors 

x1^  in  the  following  way.  The  vectors  x  ,  ...,x  are  picked  up  to  satisfy  in  addition 

7'*'  -  (x11,...^  *1,  ■'  s  '  s  n^  ,  (n^*  -  0  if  TS  a  [01) 
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r 


We  claim  that  the  vectors  x1^  can  be  chosen  in  each  stage  to  satisfy  also 


(1.20. 12) 


TS“PU  =  [x1 1 , 
1(m  +1) 

p-2 


1  m 


1  (p~1)1 

t ♦ • . f  X  /.../X 


pn; 

/  I  •  •  •  /  X  | 


1mp-1  2(mp-2+1) 

••/X  ,  x 


2(m  +n'  ) 

P-2  p-1 


l(m  +n*  ) 
„  P"1  P 


]  , 


1  ( UL  |+1  )  e-n 

where  0  <  n '  <  n  .  (If  n'  =  0  then  x  **  /  T  PU).  Suppose  we  already  proved 


P  P  P 

the  claim  for  q  =  p-1.  Let 


TS-pU  =  (x11 . x1"1 . x(p-ln, 

1(Bp-2+1)  ’Vi  2(V2+1> 

X  r  ,  X  , 


(p-1 )m 


\  xP\ 


pn; 


2(m  +n‘  ,) 

p-2  p- 1  1 

<x  .  y  , 


.yfc)  • 


As  in  the  proof  of  Theorem  1.19.1  we  may  assume  that  Ty3  »  0,  j  ■=  1,...,t.  To  finish  the 
proof  we  have  to  show  that  the  vectors  x^,  i  =  1,...,p-r+1,  j  =  mj._1  +  1 , . . .  ,mr , 
r  =  1,...,p-1,  y1,...,yt  are  linearly  independent.  Suppose  that  these  vectors  are 


linearly  dependent.  Applying  T  to  the  vector  in  question  and  using  the  fact  that  x1^ 

.  lm,  1(m.+1)  1m,  lin,  , 

are  linearly  independent  we  deduce  that  x,.,.,x  ,  x  '  ,...,x  pi, 

y\***#y^  have  to  be  linearly  independent.  This  is  impossible  since  these  vectors  are 


part  of  the  basis  vectors  for  Ts-pU.  So  t  <  n  and  we  can  choose  vectors 

Km  ,  +  1  Km  ,+n  )  P  i 

x  P  /.../X  P  P  such  that  the  first  n'  vectors  coincide  with  y y+.  The 


equality  (1.20.12)  establishes  the  theorem. 


MICROCOPY  RESOLUTION  TEST  CHART 

NAIIONAl  BUREAU  OF  SUNDARDS  1%3  A 


1.21  The  equation  AX  -  XB  »  0 . 

Let  A,B  e  M  (D).  A  possible  way  to  determine  whether  A  *  B  over  M„(D)  is  to 
n  n 

consider  the  matrix  equation 

(1.21.1)  AX  -  XB  =  0  . 

Then  A  *»  B  if  and  only  if  there  exists  a  solution  X  such  that  |X|  is  an  invertible 
element.  If  we  consider  X  as  a  column  vector  X  composed  of  n  columns  of  X  then  the 
equation  (1.21.1)  has  a  simple  form  in  the  tensor  notation  (e.q.  Marcus-Minc  [1964],  see 
also  Problem  1.21.17) 

t  * 

(1.21.2)  (I  ft  A  -  B  a  I)X  =  0  . 

Thus  is  D  is  a  Bezout  domain  then  the  set  of  all  X  Mn(D)  satisfying  (1.21.1) 
form  a  D-module  with  a  basis  X^-.-.X^,  (Theorem  1.11.12).  So  any  matrix  X  which 
satisfies  (1.21.1)  is  of  the  form 

v 

x  -  I  x.X  ,  x  e  D,  i  -  1,...,v  . 

i-1  1 

Thus  it  is  "left"  to  find  whether  a  function 

v 

<5(x  ,..e,X  )  -  |  l  x  X  | 
i«1 

has  an  invertible  value.  In  such  a  generality  this  is  a  difficult  problem.  A  more  modest 
task  is  to  find  the  value  v  and  to  determine  if  5(x^,...,xv)  vanish  identically.  For 
that  purpose  it  is  enough  to  assume  that  D  is  actually  a  field  F  (for  example  the 
quotient  field  of  D).  Also,  we  may  replace  F  by  a  finite  extension  K  in  which  the 
characteristic  polynomials  of  A  and  B  split  to  linear  factors.  Finally,  we  are  ooino 
to  study  slightly  more  general  case 
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A  e  M  (K),  B  e  M  (K),  X  e  M  (K)  . 

in  n  mn 

Let  i|i(x)  end  ys( x)  and  J  and  K  be  the  minimal  polynomials  and  Jordan  canonical 
forms  of  A  and  B  respectively . 


4><x)  -  ( x-X t )  ...(x-X^)  ,  Xi  ¥  Xj  for  i  f  j  , 

fc1  fck 

<f{x)  -  (x-Uj)  . » •  ( x~Mj{ )  '  ui  ^  for  1  *  j  . 


(1.21.3) 


1  * 

p"  AP  -  J  -  l  •  J  ,  J  -  l  «  (X  Km  )  ♦  H(m  )),  1<m  <...<m  «  s  , 

J,«1  *  1  j.J  1  *J  11  1 

1  k  Pi 

0  Bfl  »  X  -  l  •  Kt,  K±  -  l  •  +  H(ni;j)),  1<nlp  c...^  •  t±  . 


y  -  p  'xq  . 

Then  the  system  (1.21.1)  is  equivalent  to 

JY  -  YK  «  0  . 

We  partition  Y  conformally  to  the  partitions  of  J  and  K  as  given  in  (1.21.3).  So 


r  r3 

(*..)»  Y  e  M  (X),  m  -  l  m  ,n  -  l  n  ,  i-1,...,i  ,  j-1,...,£ 
lj  ij  mtn  i  j.,  lr  )  r.,  Jr 


Thus,  the  matrix  aquation  for  Y  reduces  to  Ik  matrix  equations 


(1.21.4) 


Vij  -  YijKj  ■  °'  1  ■  1 . *'j  -  1 . k 


The  following  two  lemma  analyse  the  equations  (1.21.4). 

Lemma  1.21.5.  Consider  a  matrix  equation  1.21.4  for  some  choice  of  1  <  i  <  l  and 
1  <  j  <  k.  If  \i  f  then  Yi3  •  0. 

Proof.  Put 

qi 

Ji  “  X1  1<mi>  +  V  ^1  ”  ^  *  H<n,ir)  ' 

r*  1 


X  -  JI  I(n  )  +  X  .  X  -  l  •  H(n  )  . 

J  J  J  J  J  ir 


Then  (1.21.4)  becomes 


—  u  —  v 

Note  that  J.  =  K.  =0  for  u  >  m,  and  v  >  n; . 

i  3  i  j 


<xi  -  VT«  "  +  YijKj 


Thus 


(X  -(j  )  Y.  .  =  -J.(X  -U.)Y.  .  +  (X  -p.)Y.  .K. 
i  j  i]  i  i  3  iD  i  j  i3  3 

‘  ""I'  “JiYii  +  W  +  "Vi;  +  Vi".  ’ 
■  '“VSi +  2(-VYi3*i +  Vj2  • 


Continuing  this  procedure  we  get 


(X 


.VV  J, ‘S'-vVT  • 


Whence,  for  r  =  m^+n^  either  or  K ^  u  is  a  zero  matrix.  Since  X^  f 


we  deduce 


that  Y^ j  =  o. 


Lemma  1.21.6.  Let  Z  =  (z^)  e  M^tF)  satisfy  the  equation 


(1.21.7) 


H(m)Z  -  Z  H(n)  . 


Then  the  entries  of  Z  are  of  the  form 


z  =  0  for  6  <  u+n  -  min(m,n)  , 

up 

z  *»  z(u+1)(B+1)  for  6  *  a+n  -  min(m,n)  . 

up 


In  particular  the  subspace  of  all  m  x  n  matrices  Z  satisfying  (1.21.7)  has  the 
dimension  min(m,n). 

Proof .  As  the  last  row  of  H(m)  and  the  first  column  of  H(n)  are  equal  to  zero  row  and 
column  respectively,  in  view  of  the  equality  (1.21.7)  the  last  row  of  ZH(n)  and  the  first 


column  of  H(m)X  vanish.  That  Is  1,...,n-1,  g  «  ?,..., in.  Tn  all 

wp  ft  ? 

other  cases  hy  eguatino  the  (0,6)  entries  of  H(m)7  and  XH(n)  we  get  *  « 

(o+1)6 

zo(6-1)'  a  "  1 « •  •  •  f1""1 1  6  “  2,...,n.  The  above  set  of  eoualitles  imply  the  condition 

(1.21.8) . 

Combine  the  two  lemmas  to  obtain 

Theorem  1.21.0.  Consider  the  system  of  matrix  equations  (1.21.4).  Then  Vj ^  •  n 

¥  Uj  (i.e.,  Ji  and  Kj  do  not  have  a  common  eigenvalue).  Assume  that  Xj  » 
Partition  Y^j  conformally  with  the  partition  of  and  as  given  in  (1.21.3). 


Yii  "  »T'-  Y11V>  6  *m  n  (r)'  U  “  ’'""V  v’  ’* 
ij  ij  13  mluniv  1 


"i  * 


Then  each  is  of  the  form  prescribed  in  Lemma  1.21.8  with  m  -  mJu  and  n  -  n^„. 

Assume  that 


(1.21.10)  X^  *  |l1,...,Xt  *  Ut,  Xj  ¥  ,  i  «*  t+1,...,l,  1  »  t+1,...,V  . 

Then  the  dimension  of  the  linear  subspace  Mnn(lC)  of  matrices  v  «  (v^). 


i  «  1,...,i,  j  ”  1,...,V  satisfying  (1.21. 4)  is  given  hy  the  formula 


t  ai»Di 

(1.21.11)  dim  V  «  y  J  min(m  ,n.  )  . 

..  iv 


Let  us  consider  a  special  case  of  (1.21.1) 


(1.21.12)  AX  -  XA  «  0  ,  A  €  M  (O)  . 

T1 


In  that  ease,  a  D-module  o+  all  matrices  X  e  M  (n)  satisfying  (1. si. I'M  is  in  fact  s  ring 
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( non-eommutative  in  general)  with  an  identity  T.  Denote 


In  case  that  D  is  a  field  F  or  more  generally  when  C(A)  has  a  finite  basis  (according 


to  Theorem  1.11.12  this  assumption  holds  if  D  is  a  Bezout  domain)  then  according  to 
Theorem  1.21.9 


dim  C(A) 


i  qi 

I  l 

i-1  u,v-1 


mintm  ,m  ) 

1U  IV 


(Clearly  the  dimension  of  C(A)  is  not  changed  if  we  let  X  e  M  (K)).  As  {m.  },  is  a 

n  iu  i 

decreasing  sequence  we  have 


l  min(m  ,n  ) 
iu  iv 

v»1 


Mi 

ua,  +  T  IU, 
iu  iv 

v-u+1 


So 

i  qi 

(1.21.14)  dim  C(A)  •  £  £  (2u-1)m 

i-1  u-1  1U 

Let  i1 (x) , . . . ,in< x)  be  the  invariant  polynomials  of  xI-A.  Use  ( 1 . 19. 15)-( 1 . 19. 16)  to 
deduce 

n 

(1.21.15)  dim  C( A)  -  V  (2u-1)  deg  i,  ..(x) 

(n-u+1) 

u»1 

With  the  help  of  the  above  formula  we  can  determine  when  any  commuting  matrix  with  A  is  a 
polynomial  in  A.  Clearly,  the  dimension  of  the  subsapce  spanned  by  the  powers  of  A  is 
equal  to  the  degree  of  the  minimal  polynomial  of  A. 

Corollary  1.21.16.  Let  A  e  Mn(F).  Then  each  commuting  matrix  with  A  can  be  expressed 


A  are  .Identical.  That  is,  A  is  similar  to  a  companion  matrix  C(p),  where 
p(x)  -  |xI-a|  . 

A  matrix  for  which  the  minimal  and  the  characteristic  polynomial  coincide  is  called 
nonderogatory,  otherwise  derogatory. 

Problems 

(1.21.17)  Let  li  :  H  <D)  *  M,  ..(D),  such  that 
nn  ( nn )  i 

a  *  a 

p(X)  «  X  ■  ( x^ ) ,  j  ■  1,...,mn,  x^  «  xik  for  j  -  (k-1 )m  +  i  , 

i  •  1,...,ra,  k  m  i,...,n 


Prove 


(1.21.18)  ji(AX)  -  ( I ( n)  ft  A)V(X),  p(XB)  -  (Bt  ft  Km)u(X),  A  E  M  (D).  B  e  M  (D)  . 

in  n 

Here  A  ft  B  is  the  Kronecker  product 

(1.21. 19 ) A  ft  B  -  (a.  B)  e  M,  ,,  ,(D),  A  -  (a  )  e  m  (D),  B  -  (b.  .)  e  M  (D)  . 

ij  (mpHnq)  ij  mn  ki  pq 

Prove 


(1.21.20)  (A,  ft  A^B,  ft  B2)  -  (*,8,)  ft  (A2B2),  At  E  M^tD).  Bi  E  "^(D), 


i  -  1,2  . 


(1.21.21)  Let  P  E  M  (P),  C  E  M  (F),  R  E  M  (F).  Put 
in  n  inn 


A  -  <*  *),  B  -  (*  °)f«  (F) 

0  c  0  Q  mtn 


Assume  that  the  characteristic  polynomials  of  P  and  Q  are  coprime.  Show  that  there 

exists  a  matrix  X  ■  I1!™*  T,  ,)  which  satisfies  (1.21.1).  This  in  particular  implies 
o  nn) 

A  B. 
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(1.21.22)  Let  A  *  y  •  A.  e  M  <r>.  Prove  that 

L  in 

i=i 


(1.21.23) 


dim  C( A)  >  l  dim  C(A. ) 

i-1  1 


and  the  equality  sign  holds  if  and  only  if 


( |xl-A1l ,  I xI-Aj | )  =  1  for  i  y  j,  i.j  -  1 . i  . 


(1.21.24)  Let  ACM  (D).  Show  that  the  ring  C(A)  is  a  commutative  ring  if  and  only 
n 

if  A  satisfies  the  conditions  of  Corollary  1.21.16  where  P  is  the  quotient  field  of  D. 


(1.21.25)  Let  ACM  (D).  Let  Be  C(A).  Then  B  is  an  invertible  element  in  the  ring 

n 

C ( A)  if  and  only  if  B  is  a  unlmodular  matrix. 

(1.21.26)  Let 

(1.21.27)  C(A,B)  ■»  {X|X  C  M  (D)  ,  AX-XB  »  0}  A  C  M  (D) ,  B  C  M  (D)  . 

mn  nt  n 

Show  that  C(A,B)  is  a  left  (right)  module  of  C(A)  (C(B))  under  the  matrix 
multiplications. 

(1.21.26)  Let  A,B  c  M  (D).  Prove  that  A  a  B  if  and  only  if  (i)  C(A,B)  is  a  C(A)- 

n 

module  with  a  basis  containing  one  element  Vr  (ii)  any  element  basis  U  is  a  unimodular 


1.22  A  criterion  for  similarity  of  two  matrices. 

Definition  1.22.1.  Let  A  e  M  (D)  and  B  t  M  (D).  Denote  bv  r(A.B)  and  v(A.B)  the 

1 1  "  11  111  ifl  n  . .  ■  ■ 

rank  and  the  nullify  of  the  matrix  I(n)  a  A  -  Bt  •  l(m)  viewed  aa  a  matrix  over  M^CF) 

where  F  ia  the  quotient  field  of  D. 

According  to  Theorem  1.21.9  we  have 


(1.22.2) 


Theorem  1.22.3. 


t  Vpi 

v(A,B)  -  l  l  min(m  ,n  )  , 

i"1  u,v~1 

t  Vpi 

r ( A,B)  -  mn  -  £  £  minim  ,n  ) 

i-1  u,v-1  lu  iv 


Let  A  e  H  (D) ,  B  e  M  (D) .  Then 
m  n 


V(A,B)  <  j(v(A,A>  +  V (B,B) ]  . 


The  equality  sign  holda  if  and  only  if  m  «  n  and  A  and  B  are  einilar  over  the 
quotient  field  F. 

Proof,  without  lose  of  generality  we  may  assume  that  D  -  F  and  the  characteristic 
polynomials  of  A  and  B  splits  in  F[x) .  For  x,y  e  R  consider  the  function  min(x.y) 

(the  minimum  of  the  values  x  and  y).  Clearly,  min(x,y)  is  a  homogeneous  concave 

2  X+U  y+v  1 

function  on  R  min(ax,ay)  »  a  min(x,y),  a  >  0,  rain(-j— ,  *^— )  >  jfmlnlx.y)  min(u,v)J  . 

So 


(1.22.4)  min(a+b,c+d)  >  j[min(a,c)  +  min(b,d)  +  min(a,d)  +  mln(b,c)]  . 

Moreover,  a  straightforward  calculation  shows  that  if  a  *■  c  and  b  «  d  the  equality  sign 
holds  if  and  only  if  a  «  b.  Let 


N  *  max(m,n) ,  m 


Jv 


0,  for  <  u  <  N,  pA  <  v  <  H,  i 

j 


1, ...,£, 

1 , . . . ,k  . 
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"T  T 


i  ,  N  k ,  N  t ,  N 

v(A,A)  +  v(B,B)  =  j  (2u-1)m.  +  j  (2u-1)n.  >  j  (2u-1)(m.  ♦  n .  )  , 

.  L  .  iu  .  L  .  DU  .  .  xu  xu 

1,U=1  j,U“1  i,U=1 

and  the  equality  sign  holds  if  and  only  if  i  =  k  =  t.  Next  consider  the  inequality 

t,N  t  N 

I  (2u-1)(m.  +  n  )  -  J  y  mintm.  +  n.  ,  m.  +  n.  )  > 

.  .  iu  iu  .  .  *•  .  xu  iu  iv  iv 

i,u=1  i=1  u,v=1 

1  t  N 

—  J  y  min(m.  )  +  min(n,  ,n,  )  +  min(m.  ,n.  )  +  mintn,  ,m.  ) 

2  iu  iv  iu  iv  iu  iv  iu  xv 

,  t,N  t  qi'Pi 

=-  l  (2u-1)(«iu  ♦  niu)  ♦  J  l  min(m.u,niv)  . 
i,u“1  1- 1  u,v“1 

By  looking  at  the  terms  where  u  -  v  from  the  equality  case  in  (1.22.4)  we  deduce  that  the 

equality  sign  in  the  above  inequality  holds  if  and  only  if  m^u  -  niU<  u  “  1  «•••,!», 

i  ”  1 , . . . ,t.  The  above  inequality  is  equivalent  to 

.  t,N  t  VP1 

-  I  (2u-1)(miu  +  niu)  >  l  l  min(m<i(,n<w)  . 


i»1  u,v«i 


Combining  all  these  results  we  obtain  the  inequality  (1.22.3).  The  equality  sign  in 
(1.22.3)  holds  if  and  only  if  A  and  B  have  the  same  Jordan  canonical  forms.  That  is 
m  =  n  and  A  is  similar  to  B  over  F. 

□ 

Suppose  that  A  B.  That  is  (1.15.2)  holds.  Then  the  rules  for  the  tensor  products 
(Problem  1.21.20)  imply 


itA-Btai  =  ((O*)"1  a  i)  (i  a  a  -  a1  a  Dtc1  *  D  , 


n  !  -  b*  i  i  *  (ic1  i  g](i  s  s  -  «  me1  i  o"1) 


That  it  is  the  three  matrices 
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(1.22.6) 


I  •  A  -  A*  •  I,  I  •  A  -  Bfc  •  I,  I  •  !  ■  !(  I  I 


are  similar.  In  particular  these  matrices  are  equivalent.  Over  a  field  r  the  above 
matrices  are  equivalent  if  and  only  if  they  have  the  same  nullity.  Hence  Theorem  1.22.3 
yields. 

Theorem  1.22.7.  Let  A, 8  e  IMF) .  Then  A  and  B  are  similar  if  and  only  if  the  three 
matrices  in  (1.22.6)  are  equivalent. 

The  obvious  part  of  Theorem  1.22.7  extends  trivially  to  any  integral  domain  D. 

Lemma  1.22.8.  Let  A,B  c  M^fD).  _If  A  and  B  are  similar  over  D  then  the  three 
matrices  in  (1.22.6)  are  equivalent  over  D. 

However,  this  condition  is  not  sufficient  for  the  similarity  of  A  and  B  even  in 
case  D  «  Z.  (See  Problem  1.22.16.)  The  disadvantage  of  the  similarity  criterion  stated 
in  Theorem  1.22.7  is  due  to  the  appearance  of  the  matrix  I  0  A  -  Bt  »  I  which  depends 
on  A  and  B.  It  is  interesting  to  note  that  the  equivalence  of  just  two  matrices  in 
(1.22.6)  does  not  imply  the  similarity  of  A  and  B.  Indeed  IBA-AtBI» 

IS  (A  ♦  Xl)  -  (A  +  Xl)t  •  I  for  any  X,  but  A  is  not  similar  to  A  +  XI  for  X  t  0. 
(Problem  1.22.17.)  Also  if  A  ■  H(n)  and  B  «  0  then  v(A,A)  •  v(A,B)  “  n  (Problem 
1.22.18).  However,  under  certain  assumptions  the  equality  v(A,A)  •  v(A,B)  implies  that 


Theorem  1.22.9.  Let  A  e  fMC) .  Then  there  exists  a  neighborhood  of  A  »  (a^j) 


(1.22.10)  D( A,p)  -  (B|B  -  <b  )  C  M  (C) ,  l  |b  -a  |2  <  p2] ,  0  <  p  , 

3  i,j«1  3  3 


where  p  depends  on  A  such  that  if 


(1.22.11 ) 


then  B  is  similar  to  A. 


V(A,B)  -  V(A,A),  B  e  D( A,p )  , 


Proof.  Let  r  he  the  rank  of  I  S  A  -  Afc  B  I.  So  there  exists  indices 


““  . (“lr'a2rn'  B  =  ‘ ( 0 1 1 '  B2 1  > . (B1r'82r,)  ^  N  *  N' 


SJ  =  {1, . . .  ,n} 


such  that  |(I  B  A  -  Afc  8  I)[a|Bl|  *  0.  Also  |(I  8  A  -  A*  B  I ) 1 1 Y 1 6 ]  =  0  for 

2 

y,S  £  Q  .  Here  we  identify  the  sets  SI  x  SI  and  {1,...,n  }.  First  choose  a 

(r+1) ,n2 

positive  o  such  that 


(1.22.12) 


|(I  fi  A  -  B  B  I ) [a | 8] |  ?  0,  B  E  D(A,p)  . 


Next  consider  the  system  of  r  equations  in  variables  i,j,...,n,  out  of  n 

equations  of  (1.21.11)  (X  =  (x..,))  which  correspond  to  set  a 


(1.22.13)  J/Vkj  -  XikV  "  1  *  °1p'  j  “  °2p'  U  *  1 . . 


(1.22.14)  xk^  *  «k;.  for  (k,j)  ¥  <S1(j,82)i),  P  =  1,...,r 


In  view  of  (1.22.12)  the  system  ( 1 . 22 . 13 )-( 1 . 22 . 14 )  has  a  unique  solution  for  B  £  D(A,p). 
Also  X(A)  =  I.  Using  the  continuity  argument  we  deduce  the  existence  of  a  small  positive 
p  such  that  ) X (B )  1  /  0  for  B  t  D(A,p).  We  choose  such  p.  Let  1/  be  the  set  of 
matrices  B  which  satisfy 


(1.22.15) 


1(1  B  A  -  B  B  I)[y|S)|  =  0,  r,6  £  Q 


( r+1 )  ,n 


Thus  V  is  an  algebraic  variety.  We  claim  that  l /  n  D(A,p)  is  exactly  the  set  of 
matrices  of  the  form  (1.22.11).  Indeed,  let  B  £  1/  n  D(A,p).  Then  according  to  (1.22.15) 
v( A, B )  <  r.  On  the  other  hand  (1.22.12)  implies  that  r(A,B)  >  r.  These  inequalities 
yield  (1.22.11).  Assume  that  B  satisfies  (1.22.11).  So  (1.22.15)  holds.  Whence 
B  e  (/  n  D(A,p).  Finally,  in  view  of  (1.22.15)  we  deduce  that  for  B  e  V  n  D(A,p)  the 
equalities  (1.22.13)  imply  AX(B)  -  X(B)B  -  0. 

As  | X ( B)  )  ¥  0  we  get  that  A  *  B. 

□ 

Problems 

(1.22.16)  Show  that  for  A  and  B  given  in  Problem  1.15.8  the  three  matrices  in  (1.22.6) 
are  equivalent  over  Z  however  A  and  B  are  not  similar  over  Z  (see  Problem  1.15.8). 

(1.22.17)  Show  that  for  ACM  (F),  A  *  A  +  XI  if  and  only  if  X  «  0.  (Compare  the 

n 

traces  of  A  and  A  +  XI.) 

(1.22.18)  Show  that  if  A  «  H(n)  and  B  »  0  then  v(a,a)  -  v(a,b)  -  n. 

(1.22.19)  Let  a,b  e  m  (d) .  Assume  that  the  three  matrices  in  (1.22.6)  are  equivalent. 

n 

Let  I  be  a  maximal  ideal  in  D.  Let  P  “  D/I  and  we  may  view  A,B  as  matrices  over 
F.  Prove  that  A  and  B  are  similar  over  F.  (Note  that  the  matrices  in  (1.22.6)  are 
equivalent  over  F. ) 
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1.23  The  equation  AX  -  XB  =  C. 

A  related  equation  to  (1.21.1)  is  a  non-homogeneous  equation 


(1.23.1)  AX  -  XB  =  C,  ACM  (F)  ,  B  e  M  (F),  C  £  M  (F) 

m  n  mn 


which  is  written  in  the  tensor  notation  as 


t  *  A 

(1.23.2)  ( I ( n)  a  A  -  B  a  I(m))X  =  C  . 


A  necessary  and  sufficient  condition  for  the  solvability  of  (1.23.2)  can  be  stated  as 
follows.  Consider  a  homogeneous  system  whose  coefficient  is  the  transposed  coefficient 
matrix  of  (1.23.2)  (see  Problem  1.23.9) 

( I ( n)  a  Afc  -  B  S  I(m) )Y  =  0  . 

Then  (1.23.2)  is  solvable  if  and  only  if  any  solution  Y  is  orthogonal  to  C  (e.g. 
Problem  1.23.10).  In  matrix  the  above  equality  is  equivalent  to 

AfcY  -  YB*  -  0  . 

The  orthogonality  of  Y  and  C  is  written  as  trtY^)  »  0  (see  Problem  1.23.11). 

Thus  we  proved 

Theorem  1.23.3.  Let  A  e  fWF)  ,  B  c  M^F)  .  Then  (1.23.1)  is  solvable  if  and  only  if 

(1.23.4)  tr(ZC)  =  0 

for  all  Z  e  M  (F)  satisfying 

(  1.23.  ■>)  ZA  -  BZ  =  0  . 
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Using  Theorem  1.23.3  we  can  obtain  a  stronger  version  of  Problem  1.21.22 


Theorem  1.23.6.  Let  G 

Then 


(G^),  G  e  M  (F),  G  «  0  for  j  <  i,  i,j  -  1 
■•■j  ij  n.n.  13  - 


t. 


I 

(1.23.7)  dim  C(G)  >  J  dim  C(G  )  . 

i-1  11 


Proof.  Consider  first  the  case  1  ■  2.  Let  G  »  .  Consider  a  matrix  T 

which  commutes  with  G.  So 


(1.23.8)  AU  -  UA,  BV  -  VB,  AX  -  XB  -  UE  -  EV  . 

According  to  Theorem  1.23.3  the  matrices  U  c  C(A)  and  v  e  C(B)  satisfy  the  last 
equation  of  (1.23.8)  if  and  only  if  tr[Z(CJE-EV)]  for  all  Z  satisfying  (1.23.5).  Thus 
the  dimension  of  the  subspace  of  pairs  (U,V)  satisfying  (1.23.8)  is  at  least 

dim  C( A)  +  dim  C(B)  -  dim  C(B,A)  . 

On  the  other  hand  if  U  «  V  ■  0  then  the  dimension  of  the  subspace  of  matrices  X 
satisfying  (1.23.8)  is  dim  C(A,B).  The  equality  (1.21.11)  implies  dim  C(A,B)  • 
dim  C(B,A).  Hence  we  established  (1.23.7)  for  i  -  2.  The  general  case  immediately 
follows  by  induction  on  4 . 

We  remark  that  contrary  to  the  results  given  in  Problem  1.21.22  the  equality  in 
(1.23.7)  may  occur  even  if  G^  «  G^  for  some  i  >•  j.  See  Problem  (1.23.12). 

Problems 

(1.23.9)  Let  ABB  be  defined  as  in  (1.21.19).  Prove  (A  B  B)t  »  At  ■  Bt. 

(1.23.10)  Consider  a  system 


Ax  =  b,  A  E  M  (F),  b  E  M  .(F) 
mn  n  1 

Prove  that  the  above  system  is  solvable  if  and  only  if  y*b  »  0  where  y  is  a  solution  of 
the  system  Afcy  =  0.  (Hint;  Change  varibles  to  obtain  A  in  its  diagonal  form  as  in 
Section  1.12). 

(1.23.11)  Let  X , Y  s  M  (D).  Let  w(X),p(Y)  E  M,  , . (D)  be  defined  as  in  Problem 
1.21.17.  Prove  that 


W(Y)tu(X)  =  tr(YfcX)  . 

(1.21.12)  Assume  in  Theorem  1.23.6  1  »  2,  *  G22  *  0,  G12  *  I.  Show  that  in  this  case 

tne  equality  sign  holds  in  (1.23.7). 

(1.23.13)  Let  ACM  (F) ,  i  -  1,2  and  suppose  that  A.  and  A,  do  not  have  a  common 

i  *  * 

eigenvalue.  Assume  that  A  -  A.,  »  Aj.  Let  C  -  ( ^ ) ,  X  »  (XA  j ),  CjjfXj^  e  Mn  n  / 
i,j  -  1,2.  Using  Problem  1.21.22  prove  that  the  equation  AX  -  XA  -  C  is  solvable  if  and 


only  if  the  equations  AiXii 


XiiAi  “  Cii' 


i  -  1,2. 


The  following  result  is  needed  later 


Theorem  1.24.1.  Let  A  e  Mn(r}  be  a  nllpotent  matrix.  Put 

Xk  »  {x|x  £  Pn,  Akx  -  0},  k  »  0,1,...,  . 

Assume  that 

0  -  X0  ?Xi  ^X2  ^...^  Xp  -  F"  . 

Suppose  that  B  e  fMP)  satisfies 

(1.24.1)  BXi+1  <=  X^  i  ■  0 ,  • . .  ,p-1  . 

Then 

(1.24.3)  v(A,A)  <  v(B,B>  . 

The  equality  sign  holds  if  and  only  if  B  is  similar  to  A. 

Proof .  we  prove  the  lemma  by  induction  on  p.  For  p  »  1  the  theorem  is  trivial  since 
A  “  B  •  0.  Suppose  that  the  lemma  holdB  for  p  ■  q  -  1  and  we  prove  it  for  p  »  q.  A1 
and  B1  be  the  restrictions  of  A  and  B  to  Xq*v  Assume  that  A^.A  and  ,B  have 


the  following  Jordan  canonical  forms 


f 


i 

■! 


h 


i 


« 


piV 


u'  -  M 

-  I  0  H(m! ) ,  P*  AP  «  l  t  H(m  )  , 

1  i=1  1  i-1  1 


V  ' 

=  I  ®  H(n'),  C_1BC  =  l  9  Kfr^)  , 
i=1 


i  =  1 


(1.24.4) 


U  * 


>  m^>. . .>m^ ,  >  1,  n’  >  n*>...>n^,  <1,  £  mj  =  £  = 

u  y 

m,  >  m  >...>n  >  1,  n.  >  n  >...»n  >1,  J  m.  =  )  n 

1  2  u  1  2  v  *•  i  *•  i 


dim  X 


q-1  * 


i*1  i=1 


dim  X  =  n 

q 


Clearly  the  Jordan  canonical  form  of  Af  and  A  is  determined  completely  by  the 
dimensions  of  the  subspaces  X^,...,X  Indeed,  put 


ij/i  »  dim  Xi(  6i  -  ti+1  "  i  -  0,1,... 


Then  -8  *s  the  num^er  t^,e  Jordan  blocks  of  order  i  in  the  Jordan  canonical 


form  of  A  and  Aj.  So 


(1.24.5)  u  -  p’,  mj  -  m  -  1,  i  «  1, 


Formula  (1.21.14)  yields 


■  ■  #  6  . ,  m)  =  m.  for  i  >  0  . , 

q-1  l  1  q-1 


0  ,  “  n  -  dim  X 

q-1  q-1 


W  U’  q-1 

(1.24.6)  v( A, A)  -  I  (2u-1)m  -  )  (2u-1)m’  +  T  (2u-1)  -  vtA’.A*)  +  0  .  . 

,  U  ,  U  ,  0-1 

U=  1  U=  1  U“  1 


According  to  Theorem  1.20.10  we  have  the  following  relations  between  the  Jordan  canonical 
form  of  B  and  B1 


w 
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I 


v  >  v',  n^  +  1  >  >  n^,  i 


1 


,v\  ni 


i  .  v-v'+1,...,v  (if  v  >  V). 


So 


v  v*  q-1 

(1.24.7)  v(B.B)  -  l  <2u-1)n  >  £  (2u-1)n  +  [  (2u-1)  -  vlB’.B’)  +  9 

i«1  1  1-1  1  u-1  q~ 


and  the  equality  sign  holds  if  and  only  if  n  -  n!  +  1,  i  -  1,...,9  .  Combine  (1.24.6) 

1  i  q*i 

with  (1.24.7)  to  deduce  (1.24.3)  by  induction.  Suppose  that  the  equality  sign  holds  in 
(1.24.3).  Since 

v(A',A')  -  v<B',B*),  n.  -  nj  ♦  1,  i  -  1 . 6  . 

1  l  q-i 

The  induction  assumptions  imply  that  A’  B'.  Use  (1.24.5)  and  the  above  equalities  to 
get  A  *  B. 

D 
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(1.25.17) 


indax  (A) 


max  index  (X) 

Xe o  (A) 

P 

Problems 

(1.25.18)  Let  A  e  M^fC) ,  i)i(x)  e  C[x]  be  the  minimal  polynomial  of  A.  Assume  that  f! 
is  an  open  set  in  C  such  that  o(A)  c  0.  Let  e  H(0).  Then  the  values 

(1.25.19)  *(k)(X),  k  -  0,1,...,  index  (X)  -  1,  X  e  oJA) 

u 

are  called  the  values  of  on  the  spectrum  of  A  (0(A)).  Two  functions,  ,6  are  said 
to  be  coinciding  on  o(A)  if  they  have  the  same  values  on  0(A).  Assume  that  <fi  e  Ctx] 
and  let 


<e(x)  -  u(x)iMx)  +  8(x),  deg  6  <  deg  iji  . 


Prove  that  8(x)  coincides  with  l(x)  on  o(A).  Let 


*(*) 

i|i(x) 


io(x) 


+  Mil 

<l<(x) 


u(x) 


i 

+  l 

i-1 


l 

3-1 


lil. 


(x-X^)J 


,  *  index  (X^),  i  •  1,,.,,i  , 


where  VCx)  in  given  by  (1.18.20).  Show  that  a^,  3  -  b^,  s^-1, ... ,8^-p,  are  determined 
recursively  by  ^^(X^),  3  “  0,...,p.  (Multiply  the  above  equality  by  i)i(x)  and  evaluate 
this  identity  and  its  derivatives  at  X^.)  Thus  for  any  <f  e  H(fl)  define  8(x)  by  the 
equality 


(1.25.20) 


8 ( x )  -  iMx) 


t 

I 

i-1 


"i 

I 

3-1 


(x-Xi)- 


The  polynomial  0(x)  is  called  the  La grange -Sylvester  (L-S)  interpolation  polynomial 
of  <f  (corresponding  to  V)*  Prove  that 
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(1.25.21) 


<f  (A)  -  6(A) 


I 


i-i 

f 


Let  8  Jx)  be  L-S  polynomials  of  <t ^  e  K(S1), 
with  L-S  polynomial  of  >P.|(x)^2(x)  on  <J(A>. 


j  =  1,2.  Show  that  9  (x)0  (x)  coincides 

j  i  12 

Use  this  fact  to  prove  the  identity 


(1.25.22)  V51(A)^2(A)  =  *><A),  *<x)  =  ^(x)^2(x)  . 


(1.25.23)  Prove  (1.25.22)  by  using  the  definition  (1.25.7)  and  the  relations  (1.25.9). 

(1.25.24)  Let  the  assumptions  f  Problem  1.25.18  prevail.  Assume  that  a  sequence 

d  e  H(fl),  converges  to  ^  e  H(fl).  That  is  £_(x),  m  <=  1,2,...,  converge  uniformly 
n  *” 

to  vMx)  on  any  compact  set  of  0.  This  in  particular  implies 


lim  •f 
m-*” 


<j) 

m 


(X)  ■  •f 


(j) 


(X), 


1 


0,1 


. ,  X  e  n  . 


Use  the  definition  (1.25.7)  to  show 


(1.25.25)  lim  *  (A)  =  v"(A)  . 

nv 

m+°° 

Apply  this  result  to  prove 


(1.25*26) 


00  m  N  m 

A  r  A  .  ,  .  r  A 

E  =  Ln  5T  (=  l  mP 

m=0  m=0 


(1.25.27) 


( X I  -A ) 


[  for  I X  |  >  p  ( A )  . 

m=*0  X 


1 .26  Cesaro  convergence  of  matrices 
Let 


0.26.1)  K  =  (a'  )  e  M  <C),  k  -  0,1,2,... 

K  1 3  itin 

be  a  sequence  of  matrices.  The  p-th  Cesaro  sequence  is  defined  as 


(1.26,2) 


\,0  ’  Vp 


,  (k,P) 
ij 


k 

I  A  ,/(k+1>,k 
j-0  j,P  1 


0,1 


Definition  1.26.3.  A  sequence  {A^}  Is  said  to  be  convergent  to  A 


(alj* 


if 


(k) 

lim  a...  »  a.  . ,  i  »  1 , . . .  ,m,  j  -  1 , . . .  ,n,  «■»  lim  A.  »  A  . 

k+«  13  13  k*« 


A  sequence  {a^}  is  said  to  be  p(>0!  -  Cesaro  convergent  to  A 


lim  A 
k+” 


k,p 


A 


and  exactly  p(>1)  -  Cesaro  convergent  if  in  addition  to  the  above  equality  the  sequence 
A^  p_i»  A  ”  0,1...,  is  not  convergent. 

It  is  a  standard  fact  ( e .  g .  Hardy  (1949))  that  if  { }  is  p-Cesaro  convergent  then 
{A^}  is  (p+1)  -  Cesaro  convergent.  A  standard  example  of  exactly  1  -  Cesaro  convergence 
sequence  is  {X  },  |Xf  «  1,  X  ?  1.  More  precisely  we  have  (e.g.  Hardy  [1949]  or  Problem 


1.26.11). 

K  k 

Lemma  1.26.4.  Let  |X|  «  1,  X  ?  1.  Then  the  sequence  (  ,  k  »  0,1,...,  is  exactly 

p-Cesaro  convergent  to  zero  for  p  >  1 . 

We  now  show  how  to  recover  the  component  Z  ,  for  0  y  X  e  o  (A)  by  usinq  the 

CMS  •  * )  a  p  7 

01  j 

notion  of  Cesaro  convergence.  ] 
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and  any  j  or  for  |A|  -  1  and  j  <  a  -1  this  sequence  converges  to  zero.  For 

a 

A  *  1  and  j  -  sq-1  the  'bove  sequence  converges  to  1.  For 

I  A. |  -  1,  A  ?  1  and  j  >  s  — 1  the  given  sequence  is  exactly  j  -  s  +2  Cesaro 
lx  ex  a 

convergent  to  zero  in  view  of  Lemma  1.26.4.  These  arguments  establish  the  theorem. 

C 

Corollary  1.26.9.  Let  the  assumptions  of  Theorem  1.26.5  hold.  Then 


(1.26.10) 


lim  (  l  (s-1)l  <p"1(A>A),'a8"1)/(N+1>  -  z,  s  -  index  (A) 
N+“>  k-0 


where  Z  «  0  unless  A1  -  P(A)  e  0(A)  and  index  (A^  -  index  (A)  in  which  case  Z 


21  ( s-1 ) * 


(1.26.11)  Let  1 A |  -  1,  A  j  1  be  fixed.  Using  the  formula 


h-1  .  A  , 

I 


prove  by  differentiating  the  above  equality  r  times 


I  <3)A3  -M[  a(A,r ,1)  (k'1)Ak"1]  +  (-1)r  X  , 
j-0  r  1-0  *  (A-1  )~J 


,  k  >  1 


where  <x(A,r,4)  are  some  fixed  non-zero  functions.  Use  the  induction  on  r  to  prove 
Lemma  1.26.4. 

(1.26.12)  Let  x)  be  a  normalized  polynomial  of  degree  p-1.  Prove  that  the  sequence 


{v>(k)A  },  |A|  -  1,  A  y  1,  is  exactly  p-Cesaro  convergent. 


(1.26.13)  Let  A  e  M  (C) .  Put 

n 

(1.26.14)  Z..(A)  =  (z'lj>),  y.v  =  1 , . . . ,n ,  X.  e  0J(A),  j  =  0,... .index  (X.)  -  1  . 

i;j  yv  id  i 


(1.26.15)  index  (X.)  =  1  +  max(j  :  z  J  ^  0,  j  =  0,..., index  (X.)  -  1}  ,  where 

yv  i  yv  l 

index  (X.)  =0  if  z*^'  =  0,  j  =  0,... .index  (X.)  -  1 
yv  i  yv  l 

(1.26.16)  p  (A)  =  max{|X.|  :  index  (X.)  >  0}  where  P  (A)  =  -*•  if 

yv  l  yv  l  yv 

index  (X.)  =  0  for  all  X.  e  o(A). 
yv  l  l 

(1.26.17) 

index  (A)  =  max{index  (X.)  :  index  (X  )  >  0,  |X.|  =  p  (A)} 
yv  yv  i  yv  i  i  yv 

Here  index  (A)  =  0  if  p  (A)  =  The  quantities  index  (X.), 

yv  yv  yv  i 

index  (A),  p  (A)  are  called  the  (y.v)  index  of  X.,  the  (y.v)  index  of  A  and  the 
y  v  yv  l 

(y.v)  spectral  radius  of  A  respectively.  Or  shortly  the  local  indices  of  Xi  and  A 
and  the  local  spectral  radius  of  A.  Show  that  Theorem  1.26.5  and  Corollary  1.16.9  could 
be  stated  in  a  local  form.  That  is  for  1  <  y.v  <  n  assume  that 

“  Pyv(A)'  sa  indexy/V'  **  "  '‘yv’’'  \  =  '“yv.k’'  \,p  =  <W  where  \  1 


given  by  (1.26.6)  and  A^  p  by  (1.26.2).  Prove 


(1.26.7)' 


(a(s  -D) 


lim  a  =  z 

k*<.  ^<kP 


p  «  index  (A)  -  index  (X  )  +  1  . 

r  yv  yv  a 


(1.26.10)' 


lim{  I  ( S-1 )  !  tp“’(A))  ka  /kS  VtN+1)  «  z  , 

k=0  1  ' 

s  »  index  (A),  p  (A)  >  0  , 

yv  yv 


where  z  =0  unless  X.  =  p  (A)  c  o(A)  and  index  (X.)  •  index  (A)  in  which  case 
yv  1  yy  yv  1  yv 

z  «  Z1'3*1'.  Finally  A  is  called  irreducible  if  p  (A)  «  p ( A )  for  y.v  =  1,...,n. 
yv  y  v  -  yv 

Thus  for  irreducible  A  the  local  and  the  global  versions  of  Theorem  1.26.5  and  Corollary 


1.26.9  coincide. 


Consider  an  iteration  given  by 


(1.27. 1)  xi+1  «  Ax1  +  b,  i  «  0,1,... 

where  A  t  M  (C) ,  x^,b  e  M  (C).  Such  an  iteration  can  be  used  to  solve  a  system 
n  n  i 

(1.27.2)  x  «  Ax  +  b 

Also,  the  iteration  (1.27.1)  appears  naturally  in  certain  physical  instances  where  a  given 
physical  system  evolves  discretely  in  time  according  to  (1.27.1)  (e.g.  Berman-Plemmons 
[1979].  Assume  that  x  is  the  only  solution  of  (1.27.2)  and  put  y1  •  x1  -  x.  Then 

(1.27.3)  yi+1  -  Ay1,  i  -  0,1,2,...  . 

We  would  like  to  know  under  what  conditions  lim  y1  -  0,  regardless  of  the  initial 

i-*» 

condition  y°.  This  is  equivalent  to  the  statement  that  x'1*  converges  to  x  for  any 
initial  condition  x* 0 ' .  In  some  other  instances  the  evolusion  of  a  certain  physical 
system  is  given  by  (1.27.3).  In  that  case  it  is  important  to  know  whether  the  system  would 
not  blow  up.  That  is  there  exists  a  constant  M(y°)  (depending  on  y°)  such  that 


(1.27.4)  ay1!!  <  M(y° ) ,  i  «  0,1,...  . 

Here  for  B  e  M  (C)  we  define  the  norm  of  B  -  IBI  -  as 
mn 

n 

(1.27.5)  1 B II  =  max  l  lb  I,  B  -  (h  ,)  e  M  (C)  . 

1<i<m  j-1  iJ  mn 

Definition  1.27.6.  The  system  (1.27.3)  is  called  stable  if  the  sequence  y1,  i  ■ 

0,1,...,  converges  to  zero  for  all  choices  of  y°.  The  system  (1.27.3)  is  called  bounded 


if  the  sequence  y1,  i  “  0,1,...,  1s  bounded  for  all  choices  of  y°(  (1,27.4)  holds. 

Clearly,  the  solution  to  (1.27.3)  is 

y1  “  aV,  i  -  0,1,...,  , 

so  (1.27.3)  is  stable  if  and  only  if 

(1.27.7)  lim  A1  =  0 

and  (1.27.3)  is  bounded  if  and  only  if 

(1.27.8)  II  A1  It  <  M,  i  =  0,1,..., 
for  some  positive  number  M. 

Theorem  1.27.9  Let  A  c  (C) .  Then  the  powers  A1,  i  «  0,1,...,  converge  to  aero  matrix 


(1.27.10) 


0(A)  <  1 


These  powers  are  bounded  if  and  only  if 

(1.27.11)  p ( A)  <  1,  and  index  (A)  =  1  if  p(A)  =  1  . 

Proof.  Consider  the  formula  (1.26.8).  Since  all  the  components  of  A  are  linearly 
independent  (1.27.7)  is  equivalent  to 


lim  <k)^-j  -  0,  X.  e  o(A),  j  =  0,1,...,  index  (X.)  -  1  . 

,  Ilia  l 

k-x»  J 


Of  course,  the  above  conditions  are  equivalent  to  (1.27.10).  The  condition  (1.27.8)  is 

,  is  bounded.  This 

immediately  yields  that  p(A)  <  1.  If  p(A)  <  1  then  (1.27.7)  holds  which  implies 


k  k- j 

equivalent  to  the  statement  that  the  sequence  (^)X^  »  It  “  0,1 


(1.27.8).  Suppose  that  p(A)  =  1  and  let  X  e  0  (A),  i.e.  |X  I  *  1.  Then  the  sequence 

ip  l 

k  k-i 

(^)X^  is  bounded  if  and  only  if  j  =  0.  That  is  we  must  have  index  (X^)  =  1.  This 
establishes  (1.27.11). 


Problems 

(1.27.12)  Let  ACM  (C)  and  i|Mx)  the  minimal  polynomial  of  A  is  qiven  by 
n 

(1.18.20).  Verify 


(1.27.13) 


At  1  Vl  ^ 

e  =  )  •  '■  z 

i=t  j-0  3  3 


Use  (1.25.9)  or  (1.25.26)  to  prove 


(1.27.14) 


d  At  At  At 

—  ( e  !  *  Ae  =  e  A  . 


Prove  that  the  system 


(1.27.15) 


-  Ax,  x(t)  e  M  (C) 
at  n  i 
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has  a  solution 


Wl-  r-T>  wr^-i'^jWs*..:  ,>j AOWffP'fKVjt 


(1.27.16) 


A(t-t0) 

x(t)  -  t  x( ip ) 


Th.  system  (1.27. IS)  is  called  stable  if  lim  x(t)  ■  0  for  any  solution  (1.27.16).  The 

t** 

system  (1.27.15)  is  called  bounded  if  any  solution  (1.27.16)  satisfies 
lx(t)l  <  M,  t  >  tfl,  M  -  M(x(t0)>  . 

Prove  that  (1.27. IS)  is  stable  if  and  only  if 


(1.27.17) 


Re{X}  <  0  for  X  e  o(A)  , 


and  (1.27. IS)  ia  bounded  if  and  only  if 


(1.27.18) 


Re{X}  <  0,  index  (X)  “  1  if  Re{X}  »  0  . 


,T  }  be  a  set  of  disjoint  simple  connected  rectifiable  curves  such  that  T 


6- 


la  a  boundary  of  an  open  domain  D,  and  r  u  D  c  (2.  Let  A  c  M^tC)  and  assume  that 
o(A)  c  0.  Then  for  any  ^(x)  analytic  In  fl  we  have 
(1.28.8)  *(A)  -  rV  /  R(  x,AK(x)dx  . 

'  j, 

Proof.  Insert  the  expression  (1.28.1)  into  the  above  integral  to  get 


1 

2vi 


/  R(x,A)V>(x)dx 

r 


i 

l 

k-1 


V1 

I 

j-0 


[27i  /  tx-^v)‘(j+1)^(x)dx)Z 


kj 


Now  use  the  identity  (1.28.6)  to  deduce 


1 

2wi 


/  R(x,A)l(x)dx 

r 


4  k 

I  I 

k-1  j-0 


(j) 


<V 


11 


kj 


The  definition  (1.25.7)  of  ( A)  yields  the  equality  (1.28.8). 


□ 


He  illustrate  the  usefulness  of  Cauchy  integral  formula  (1.28.8)  by  two  examples. 
Theorem  1.28.9.  Let  A  c  M^tc)  and  assume  that  \  e  o(A).  Let 


(1.28.10) 


X  ED,  X  D  o  T  for  X  t  o(A),  X  ?  X 
P  P 


Proof .  As  in  the  proof  of  Theorem  1.28.7 


i  V1 


/  R( x,  A)  ( x-X  )q  1  dx  =  2  1  / 

21,1  r  p  xi,  j-o  21,1  r 


( x-X  ) 


q-1 


(x-X,  )J 


dxlZ^  . 


( x-X  ) 


q-1 


Suppose  that  X,  f  X  .  Then 
k  p 


( x-X  ) • 


(x-X  ) 

is  analytic  in  r  p  D  c  !2.  Put  y*  (x)  =  - E_ 

(x-X,/ 

k 


and  use  (1.28,5)  to  deduce  that  the  corresponding 


integral  which  appears  in  the  above  equality  vanishes.  The  same  result  applies  for 

X.  ■  X  and  j  <  q-1.  For  j  >  q+1,  put  <£(x)  =  1  then 
k  p 


“  .  =  JZ  \ — T  Apply  (1.28.6)  to  deduce  also  that  the  corresponding  intearal 

(x-X  '  ^  (x‘V 

with  this  term  is  equal  to  zero.  Hence 


1 

2*i 


/ 

r 


R(x,A)(x-X  )”q+,dx 
P 


pq 


z 


pq' 


□ 

Our  next  example  generalize  the  first  part  of  Theorem  1.27.9  to  a  compact  set  of 
matrices. 

Definition  1.28.12.  A  set  A  c  Mn(C)  is  called  stable  if 

(1.28.13)  lim  sup  IAk«  =  0  . 

k-»*>  AeA 


Theorem  1.28.14.  Let  A  c  Mn(C)  be  a  compact  set.  Then  A  is  stable  if  and  only  if 
p  ( A)  <  1  for  any  A  c  A. 

To  prove  the  theorem  we  need  a  well  known  result  on  the  roots  of  normalized 
polynomials  in  C[x]  (e.g.  Ostrowski  (1966)). 


insider  a  normalized 


p(x)  ■  xm  +  I  a . xn  *  e  C[x].  Then  the  zeros  of  p(x)  are  continuous 

i-1  1  1  “ 

functions  of  its  m  coefficients.  That  vis  for  a  given  e  >  0  there  exists  6(e) 

depending  on  a1,...,an),  such  that  If  Ib^-a^l  <  6(e), 

m  _ 

1  “  it  Is  possible  to  enumerate  the  seros  of  q(x)  ■  xm  *  [  b  x”  b^ 

”  -  -  i-!  1 

,  where  In.  -  C, I  <  e,  i  »  In  particular  the  function 

i  m  "  l  1  ■ 1  i  ■ 

(1.28.16)  p(p)  -  max  |$  I 

1<i<m 

Is  a  continuous  function  of  a1,...,*n. 

Corollary  1.28.17.  For  A  e  M^tC)  the  function  p(A)  Is  a  continuous  function  of  A. 

That  is,  for  a  given  e  >  0  there  exists  5(e,A)  >  0  such  that 

|p(B)  -  p( A) |  <  e  for  IB-AI  <  6(e, A)  . 

Proof  of  Theorem  1.28.14.  Suppose  that  (1.28.13)  holds.  Then  by  Theorem  1.27.9  p(A)  <  1 

for  all  A  e  .  Assume  now  that  A  is  compact  and 

p( A)  <  1  for  A  e  A.  According  to  Corollary  1.28.17 

*  • 

p  _  max  p( A)  ■  p(A  )  <  1,  A  e  A  . 

At  A 

Recall  that  (xI-A)_1  •  (p^ j< x)/|xI-A| )  where  p^fx)  are  the  (i,j)  cofactors  of  the 
matrix  (xI-A).  So 

Ilxl-All  -  |n(x-Xi) I  >  [ | x I  -  p(A)]n  >  ( |x|  -  P)n,  for  | X I  >  p  . 
XjtoU) 

Let  p  <  p'  <  1.  Then  the  above  inequality  and  the  expression  of 
(xI-A)”1  in  terms  of  Its  cofactors  imply  that  »(xI-A)  %  <  k,  |x|  «  p', 
since  A  is  a  bounded  set.  Apply  (1.28.8)  to  get 
A P  «  /  (xI-A)  ^dx  . 


(1.28.18) 


1 


j 


-1  p  t>+ 1 

Combine  this  equality  and  the  estimate  Il(xl-A)  )  II  <  K  to  get  IA  I  <  K(p')  . 


As  p'  <  1  we  obtain  (1.28.13). 


A  generalization  of  the  second  part  of  Theorem  1.27.9  to  compact  sets  A  c  1^(0  is  a 
far  more  complicated  result  and  will  be  stated  in  the  next  chapter. 

Problems 


(1.28.19)  Let  ASM  (C).  Using  (1.28.1)  deduce  that 

n 

s  1  1 

(1.28.20)  Z.,  =  lim  (x-X.)  X(xl-A)  ,  i  -  1,...,t  . 

*(V1)  x*X.  1 


Put  R(x,A)  «  (r  (x)).  Using  the  definitions  of  Problem  (1.26.13)  show 

uv 


(1.28.21) 


z 


i( s-1 ) 

liu 


lim  (x-X . )  r  (x) ,  s 
x+X , 

l 


index  ( X . )  >  0 
MV  i 


(1.28.22)  A  set  A  £  Mn(C)  is  called  exponentially  stable  if 


(1.28.23) 


lim  sup  leAtH  =  0 
T*»  t>T,AeA 


Prove  that  a  compact  A  is  exponentially  stable  if  and  only  if  Re{Xj  <  0,  X  e  d(A)  for 
any  A. 

(1.28.24)  A  matrix  B  e  M  (C)  is  called  projection  (idempotent)  if 

n 

B  =  B.  Let  T  be  a  set  of  simply  connected  rectifiable  curves  such  that 
T  form  a  boundry  3d  of  an  open  domain  D.  Let  A  e  M^(C)  and  assume  that 
r  n  0(A)  =  0.  Put 

(1.28.25)  Pd(A)  -  ^  J  R( x ,  A ) dx  ,  A(D)  -  /  R(x,A)x  dx,  0D(A)  -  0(A)  n  D  . 
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Show  that  PD(A)  is  a  projection.  Pd(a)  is  called  the  projection  of  A  on  D  and 
A(D)  Is  called  the  restriction  of  A  to  D.  Prove 

(1.28.26)  VA)  ’  l  Zi0'  A(D>  ‘  l  tXiZiO  +  *n>  • 

X.eo_(A)  10 

ID  ID 

Show  that  the  rank  of  PD ( A)  is  equal  to  the  number  of  eigenvalues  in  D  counted  with 
their  multiplicities.  Prove  that  there  exists  a  neighborhood  of  A  such  that  PD( B) 
and  b( D )  are  analytic  functions  in  B  in  this  neighborhood.  In  particular,  if  D 
satisfies  the  assumptions  of  Theorem  1.28.9  then  pd(A)  is  called  the  projection  of  A  on 
Xp.  According  to 


(1.28.26) 

VA)  "  zPo- 

(1.28.7) 

Let  B  -  QAQ-1  e 

M  ( C) •  Assume  that 
n 

D  satisfies  the  assumptions 

of 

Problem 

1.28.24. 

Prove  that  PD(B) 

-  epD<A)(f. 

(1.28.28) 

Let  A  £  M  (C) 
n 

and  assume  that  the 

minimal  polynomial  t|i(x)  of 

A 

is  given  by 

(1.18.20). 

Let 

cn  -  a,*--- «ot 

where  each  Up  is  an  invariant  subspace  of  A,  i.e.  ADp  £  Up,  such  that  the  minimal 
polynomial  of  the  restriction  of  A  to  0p  is  (x-X^)  **.  Prove  that 

(1.28.29)  Dp  -  Zp0c"  . 

(It  is  enough  to  consider  the  case  when  A  is  in  Jordan  canonical  form.) 

(1.28.30)  Let  0^,  i-1,...,k  satisfy  the  assumptions  of  Problem  (1.28.4).  Assume  that 

D  D  »  for  i  ¥  j,  i , j  -  1,...,k.  Prove  that  P  (A)c"  n  P  (A)c"  ■  [0)  for 
l  j  Pi  uj 

k 

i  y  j.  Assume  furthermore  that  D  n  o(A)  f  4,  i»1,...,k,  c(A)c  U  D  .  Let 

1  i-1 


PD  (AJC0  »  [y!1*  ,....y(i'l ,  i»1,...,k,  x  -  (y* 1  * , . . .  ,y  * 1  * , . . .  ,y  )  e  M  (C) .  Show  that 
i  i  n 

-i  k 

(1.28.31)  X  AX  =  l  «  Bi,  C(B1)  -  Di  n  0(A),  i  -  1,...,k  . 


(1.28.32)  Let  A  E  M  (C)  and  X  c  o(A).  Prove  that 


As  in  Example 


1.29  A  canonical  form  over 

.2 

Consider  the  space  Mn(C)  .  Clearly  M^C)  can  be  identified  with  Cr  . 

1.1.13  denote  by  the  set  of  analytic  functions  f(B)  where  B  ranges  over  the 

neighborhood  D(A,p)  of  the  form  (1.22.10)  (p  *  p(f)  >  0).  Thus  the  B  -  is  an 

element  in  M_(H.).  Let  C  e  M  (H„>  and  assume  that  C  “  C(B)  is  similar  to  B  over 
n  A  n  A 

Ha*  Then 

(1.29.1)  C(B)  -  X_1(B)BX(B>  , 

where  X(B)  e  M  (H  )  and  |X(A)|  y  0.  Our  problem  is  to  find  a  "simple"  form  for  C(B) 
n  A 

(simpler  than  B  itself!).  Let  denote  the  quotient  field  of  Hft  -  i.e.,  the  set  of 

meromorphic  functions  in  the  neighborhood  of  A.  Thus  if  we  let  X  e  (M ^ )  then  we  may 

take  C(B)  to  be  R(B)  -  the  rational  canonical  form  of  B  (1.16.6).  According  to 

Theorem  1.16.11  R(B)  £  M  (HJ.  However  B  and  R(B)  are  not  similar  over  H,  in 

n  A 

general  and  we  shall  give  the  necessary  and  sufficient  conditions  for  B  *  R(B).  Thus, 
if  C(B)  *  (c± j(B) ) ,  we  may  ask  how  many  independent  variables  are  amont  the  functions 
c^tB),  i,j  »  1,...,n.  For  X(B)  «  I  the  number  of  independent  variables  in  C(B)  •  B  is 
obviously  n2.  Therefore  it  is  reasonable  to  define  C(B)  to  be  simpler  then  B  if 
C(B)  contains  less  independent  variables  than  B.  Given  C(B)  we  can  view  C(B)  as  a 
map 

(1.29.2)  C(B)  :  D(A,p)  +  H  (C)  , 

n 

where  D(A,p)  is  given  by  (1.22.10),  for  some  p  >  0.  It  is  a  well  known  result  (e.g. 
(Gunning-Rossi  [1965])  that  the  number  of  independent  variables  in  C(B)  is  equal  to  the 


rank  of  the  Jacobian  of  JC(B)  of  C(B) 


'  -»**  A 


Clearly,  AP^j  ”  pijA  is  1°  range  (A)  where 

(1.29.9)  A  =*  (I  C  A  -xA1  Cl)  :  M  (C)  *►  M  (C)  . 

n  n 


According  to  Definition  1.22.1 

dim  range  (A)  =  r(A,A) 


Let 


(1.29.10) 


M  (C)  “  range  (A)  e  (T 


u<  A,  A) 


]  • 


As  Eij<  is  basis  in  Mn(C) 


r 

p 


i.i= 


a*p*E  p 

ij  ij' 


1,...,v(A,A)  . 


Therefore 


T 

P 


n 

r 

L 

i.j-1 


(p)  3c(  A) 


i  j  3b 


ij 


According  to  (1.29.10)  are  linearly  independent.  That  is  the  inequality 

(1.29.6)  holds. 


Clearly  the  rank  of  the  Jacobian  JC(B)  is  at  least  the  rank  of  JC(A)  so  according 
to  Lemma  (1.29.6)  C(B)  has  to  contain  at  least  v(A,A)  independent  variables.  In  fact 
this  number  can  be  achieved. 

Theorem  1.29.11.  Let  ASM  (C)  and  assume  that  r_,...,r  be  any  v(A,A)  matrices 

-  -  n  "■  i  v(  a,  a  )  ■  - 

satisfying  (1.29.10).  Then  for  any  non-singular  matrix  P  t  M^tC)  it  is  possible  to  find 
X(B)  e  M  (H),  |X (0 ) |  y  0,  such  that 


(1.29.12) 


-i  -i  v(V>  -i 

X  (B)BX(B)  =  P  AP  +  )  f.(B)Pr.P  , 

i=1  1  1 


-.(B)  E  H  ,  f.(A)  =  0,  i  =  1,...,v(A,A) 
l  A  1 


Proof .  Let  R,,...,RrjA  be  a  basis  in  range  (A).  So  there  exist  such  that 

AT j  -  T j A  =  ,  j  =  1,...,r(A,A) 


Let  us  assume  that  X(B)  is  of  the  form 


r(A,A! 


(1.29.13) 


X(B)P_1  ~  I  +  1  g .  (BIT. .  g . ( A)  =  0,  j  =  1,...,r(A,A) 


j=1 


Thus  the  theorem  will  follow  if  we  can  prove  that  the  system 


r(  A,  A, ) 


r(  A,  A) 

+  l  9,(B)T  ]  =  [1+  l  g  ( B)T  ]  [A  +  \  f  <B)I\! 

j  =  1  J  J  j-1  3  3  i=1  1  1 


r(  A,  A) 


(1.29.14) 


B[I 


is  solvable  in  some  disc  D(A,p).  This  is  a  system  of  n2  scalar  equalities  in  n2 
unknowns  fl)'  9r,",9r(A  A)'  Since  fi<R)  “  9j(R>  “0.  i  *  1 , . . . ,  V  ( A,  A ) , 

j  -  1 , • • • , v( A, A)  the  above  system  is  trivially  satisfied  for  B  -  A.  According  to  the 
implicit  function  theorem  the  above  system  is  uniquely  solvable  in  some  neighborhood  of 

2 

A  if  the  Jacobian  of  these  n  equation  is  non-sinoular  (e.g.  Gunning-Rossi  [1965]).  Let 

B  =  A  +  F.  Since  all  f  ^ ,  g^  are  analytic  in  we  can  expand  these  function  in  power 

series  in  the  entries  of  F  »  (f.  .).  Let  a.  (F)  and  B.(F)  be  the  linear  terms  in  the 

tl  l  g 

expansions  of  fj  and  g^  respectively.  Then  the  Jacobian  of  the  system  (1.29.14)  B 
non-3ingular  if  and  only  if  the  first  terms  a ^  and  8_j  are  uniquely  determined  by  F. 

The  linear  part  of  (1.29.14)  reduces  to 


r(A,A’  r(A,A)  v ( A, A) 

F  +  I  VTi =  1  W +  S  Vi 

j=1  -1  J  j-1  3  J  i=1 


That  is 
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r(  A, A) 

F  ’  I 


SR  + 

j-1  3  3  i-1 


V(A,A) 

£  *,ri  • 


In  view  of  (1.29.10)  a... ...a  ,.  and  3.,. ..,8  ,.  ..  are  determined  uniquely.  So 

»  V ( A, A)  i  r( A#A) 

(1.29.14)  ie  solvable  in  D(A,p)  for  some  positive  p. 

This  establishes  the  theorem. 


Note  that  if  A  »  al  the  form  (1.29.12)  is  not  simpler  than  B.  Also  by  mapping 
T  ♦  P  'tp  we  get 

^  -1  -1 

(1.29.15)  M  (C)  -  range  (P  AP)  «  tP  r,P,...,P  V  ,.P)  . 

n  1  v( A.  A) 

Next  we  consider  the  rational  form  R(B)  of  B. 

Lemma  1.29.16.  Let  B  c  ( H R ) •  The  rational  canonical  form  of  B  over  MA  is  a 

companion  matrix  C(p)  c  m^(h^) ,  where  p(x)  »  |xi-B|. 

Proof.  The  rational  canonical  form  of  B  is  C(p1,....pk)  given  by  (1.16.6).  We  claim 

that  k  «  1.  Otherwise  p(x)  and  p’(x)  have  a  common  factor  over  Mft  which  in  view  of 

Theorem  1.4.13  implies  that  p(x)  and  p' (x)  have  a  common  factor  over  Hft.  That  is  for 

any  B  e  D(A,p)  |xI-B|  has  at  least  one  multiple  eigenvalue.  Evidently,  this  cannot  be 

true.  Indeed  consider  C  ■  P  ^BP  where  P  6  M  (C)  and  J  ■  P-^AP  is  the  Jordan 

n 

canonical  form  of  A.  So  C  e  D(J,p*).  Choose  C  to  be  an  upper  diagonal  (this  is 
possible  since  J  is  an  upper  diagonal  matrix).  So  the  eigenvalues  of  C  are 

c11,...,cnn,  and  we  can  choose  them  to  be  distinct.  Thus  p(x)  and  p'(x)  are  co-prime 

over  and  therefore  k  ■  1.  So  p^x)  ■  I  xl  — C  ( )  I  “  |xI-B|. 

Theorem  1.29.17.  Let  A  c  M  (C) .  Then  B  e  M  (H  )  is  similar  to  the  companion  matrix 
-  -  n  -  n  A  - *  1  1  — 

C(p),  p(x)  *  |xI-B|,  over  If  and  only  if  v(A,A)  "  n,  that  is  the  minimal  and  the 

characteristic  »>ol^nomials_of  A  coincide ,  i . e •  A  is  nonderogatory . 


-127- 


Assume  first  t^at  C(«)  in  can  be  chosen  to  be  r(n).  ^en  *or  *  *  A 


Lenina  1,29.18.  Let  A  CM  (C),  1  =  1,2  and  assume 

- -  -  1  n.  - 

1 

v(A  ,A  ) 


Mn  (C)  =  range  (A^  e  l  tr  i  =  1,2. 

i  1  j=1 


Suppose  that  A1  and  Aj  do  not  have  a  common  eigenvalue.  Then 


/\  v(VV  v(VV 

M  (C)  -  range  (A  e  A  )  9  l  C  [T*11  «  0)  C  £  •  [0  •  I-*21] 

"l  2  j-1  3  j-1  j 


Theorem  1.29.19.  Let  A  e  M  (C).  Assume  that  o^(A)  consists  of  i  distinct  eigenvalues 
where  nA  is  the  multiplicity  of  X^ ,  i  ■  1,...,I.  Then  B  is  similar 
over  Hft  to  the  following  matrix 


(1.29.20) 


"i 

C(B)  »  l  9  C  (B) ,  C  (B)  t  M  (H  „),  tX^Kn.)  -  C.(A)]  -  0  , 

i-1  1  1  "i.  A  i  i  i 

i  —  » 


□ 


Moreover  C(  A) 


Proof. 


®  r 


Choose 

(i) 


is  the  Jordan  canonical  form  of  A. 

P  in  the  equality  (1,29.12)  such  that  P-1AP 
Then  the  equality  (1.29.12)  yields  the  theorem. 


is  the  Jordan  canonical  form 


Problems 

k  k 

(1.29.21)  Let  A  -  J  •  H(n.  ),  n  -  J  n  .  Partition  any  B  e  M  (C)  conformally  with 
i-1  1  1 

A,  B  =  (B.  .)  ,  B.  ,  e  M  (C)  ,  i,  j  -  1 
ij  ^ 

1.23.3  prove  that  the  matrices 


k.  Using  the  results  of  Theorems  1.21.9  and 


n  v 


1.30.  Analytic ,  pointwi.se  and  rational  similar it 


Definition  1.30. 1.  Let  SI  c  c  and  A.B  c  M  (H(ft)).  Then 
■  ■  .  1 " "  n  1  1 

(i)  A  and  B  are  called  analytically  similar  (A*  B)  if  A  and  B  are  similar 

over  H(fl), 

(ii)  A  and  B  are  called  polntwiae  similar  if  A(x)  and  B(x)  are  similar  over 

C  for  all  x  e  8_ .  where  (2.  is  some  ODen  domain  S2_  ^0, 

(iii)  A  and  B  are  called  rationally  similar  ( A  »  B)  if  A  and  B  are  similar 


over  the  field  of  meromorphlc  functions  M(fl). 

Theorem  1.30.2.  Let  SI  c  c1"  and  assume  that  a,b  e  ( H ( fl ) ) .  Then  the  following 
applications  hold.  AJB»A»B»A*B. 

Proof.  Suppose  that 


(1.30.3) 


B(x)  -  P  (x)Atx)P(x)  , 


where  P.P  E  M  f H (fl ) ) .  Let  x.  C  (1.  Then  (1.30.3)  is  holding  in  some  neighborhood  of 
n  o 

Xq  .  So  A  *  B.  Assume  now  that  A*  B.  Let  C(p, , . . .  ,pk)  and  C(q1,...,qe)  be  the 

rational  canonical  forms  of  A  and  8  respectively  over  the  field  of  meromorphlc 

functions  M  (fl) .  C(p1,...,pk)  “  S(x)”1A(x)S(x) ,  C(q1.....qe>  -  T(x)_1B(x)T(x)  , 

S(x),T(x)  c  (H (SI) ) ,  | S ( x) |  ?  0,  |T(x)|  %  0.  According  to  Theorem  1.16.11  C(p1>...,p 

C(q  ,...,q  )  e  M  (H  (fl ) ) .  Let  !2  e  (1  be  an  open  connected  domain  such  that 
1  *  n  0 

A.B.S.T  E  M  (H ( fl ) )  and  A(x)  and  B(x)  are  similar  over  C  for  any  x  e  Q..  Let 
n  o 

xQ  e  !2q  such  that  |S (Xg  )T( Xg)  |  f  0.  Then  for  all  x  e  D(xq,p) 


C(pir...,p  )(x)  -  C(q1 , . . . .q^) ( x) 


Now  the  analyticlty  of  C(p1,...,pk>  and  C(q1 ... . .q^)  imply  that  these  matrices  are 


identical  in  H(!l).  That  is  A  *  B. 


Assume  that  A  ?  B.  Then  according  to  Lemma  1.22.8  the  three  matrices 


(1.30.4)  I  e  A(x>  -  At(x>  0  I,  I  0  At x)  -  B t(x)  0  I,  I  0  B(x)  -  Bt(x)  0  I 

are  equivalent  ovor  H(52).  Theorem  1.22.7  vields 

Theorem  1.30.5.  Let  A(B  £  ( H <  12 ) ) .  Assume  that  the  three  matrices  in  (1.30.4)  are 

equivalent  over  H(52).  Then  A  *  B. 

In  case  that  52  c  c,  H(!2)  is  BOO  (see  Section  1.5).  So  we  can  determine  when  these 
three  matrices  are  equivalent. 

The  problem  of  finding  a  canonical  form  of  A  £  M  (H(S2))  under  the  analytic 

n 

similarity  is  a  very  hard  problem  which  will  be  discussed  in  the  next  few  sections  for  the 
ring  of  local  analytic  functions  in  one  variable.  In  what  follows  we  determine  when  A  is 
analytically  similar  to  its  rational  canonical  form  over  -the  ring  of  local  analytic 
functions  in  t ‘ie  neighborhood  of  C  £  C™. 

For  A,B  £  M  (h(52|)  denote  by  r(A,B)  and  v(A,B)  the  rank  and  the  nullity  of  the 
n 

matrix  C»iaA-BtBI  over  the  field  By  r(A(x),B(x)>  and  v(A(x),B(x'/ 

denote  the  rank  and  the  nullity  of  C(x)  over  C.  As  the  rank  of  C(x)  is  determined  by 
the  largest  site  of  a  non-vanishing  minor,  we  clearly  have 


(1.30.6)  r(A(;),B(;))  <  r( A( x) ,B( x) )  <  r(A,B>;  v(A,B)  <  v(A(x),B(x>)  <  V( A( { ) ,B( C ) ) , 

x  E  D(C,P) 

I 

for  some  positive  p.  Moreover  for  any  p  >  0  there  exist  at  least  one  xfl  e  D(C,p)  such 


(1 . 30.7) 


r(  A(xQ  )  ,B(x0  ) )  =  r(A,R),  v(A(x0),B(x0>)  -  v  ( A,B)  . 
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Theorem  1.30.9.  Let  a  e  m  (h_).  Assume  that  <'(t>1 , . . .  ,n.)  j  s  the  rational  canonical 
form  of  A  over  M_  and  C( a. . . . .  .a. 1  is  the  rational  canonical  form  of  A(t)  over 
C.  That  is  p^  =  Pj(X,x)  and  c^  =  o_.(X)  are  normalized  polynomials  in  X  helonolno  to 
H  t X J  and  C [ X ]  respectively  lor  i  -  1,...,v,  -i  =  mtan  ( i  1  p  >  v  an* 


(ii)  n  o^_^(X)  I  IT  p^_j(X(C)  for  m  =  n,  i , . , .  ,k-1.  Moreover  p  m  y  and 
i*0  ~  i=n 

p^(X,5)  =  o^(?)  for  i  =  1, ...,*  if  and  only  if 


(1.30.9) 


r(A(O.A(;)>  =  r(A,A),  V(A(;),A(C1)  =  V( A , A)  , 


which  is  eouivalent  to  the  condition 


(1.30.10)  r(A(0,A(0)  =  r(A(y),A(x)),  v(  A(  C  > ,  A(  C  ) )  -  v  ( A(  x) ,  A(  x) ) ,  x  e  P(C,p) 


for  some  positive  p . 


Proof .  Let 


u(X,x)  »  IT  p  (X,x),  v(X)  =  II  o„(X),  i  =  1 . k,  i  •  1,  ...,£, 

n-k+i  o-1  a  n-4+1  R=1 

u  (X,x)  «  v_(X)  «  1,  for  a  <  n-k ,  B  <  n-t 
a  B 


So  Uj(X,x)  and  v^(X)  are  the  a.c.d.  of  all  minors  of  order  i  of  the  matrices  XT  -  A 

and  XI  -  A( ; )  over  the  rinos  M  [X]  and  CTX1  respectively.  (Pee  Pection  1.19.)  As 

UjfX.x)  E  H^TXl  it  is  clear  that  u^TX,?)  divides  all  minors  of  Xt  -  Afc)  of  order 

i.  Fo  u. (X ,5 ) I v. (X ) ,  i  =  1,...,n.  Since  v  .(X)  -  1  we  must  have  that  u  ,(X,xl  “ 
i  i  n— *  n** 

1.  That  is  k  <  i.  Also 


ii  (X  ,x)  =  I X  T  -  A(x)l,  v  (X)  =  I  XT  -A(CH 
n  n 


-133- 


v^(X)  .  u^fX,?) 

Whence  u  (X.c)  =  v  (X).  Therefore  — tt  ~7X — IV •  This  establishes  claim  (ii)  of  the 

n  n  v  HI  |  u(X,tl 

theorem.  Clearly  if  C( q^ , . . . ,q^ )  =  C(  p^ , . . . ,p^ ) (? )  then  k  =  £  and  p^(X,0  =  ( X  )  , 

i  =  1,...,£.  Assume  now  that  (1.30.9)  holds.  According  to  (1.21.15) 

k  £ 

v( A, A)  =  l  (2i-1)deg  p  .^(X.x),  U(A(?),A(C))  -  I  (2j-1)degq  (X)  . 

i=1  k'1  1  j=1  *"3  1 

Note  that  the  degrees  of  the  invariant  polynomials  of  (XI  -  A)  and  (XI  -  A (£))  are 
satisfying  the  assumptions  of  Problem  1.30.13.  From  the  results  of  Problem  1.30.13  it 
follows  that  the  second  equality  in  (1.30.9)  holds  if  and  only  if  k  =  £  and 
deg  p  (\,x)  =  deg  q^(X),  i  =  1,...,k.  Since  p.(X,x)  and  qi(x)  are  normalized 
polynomials  in  X  the  conditions  (ii)  yield  p^fX,?)  =  q^ ( X ) .  i  =  1,...,k.  Finally 
( 1 .30.6 )-( 1 .30.7)  imply  the  equivalence  of  the  conditions  of  (1.30.9)  and  (1.30.10). 

D 

Corollary  1.30.11.  Let  A  e  (H^) .  Assume  that  (1.30.10)  holds.  Then  A  *  B  if  and 
only  if  A  p  B. 

Proof.  According  to  Theorem  1.30.2  it  is  enouah  to  show  that  A  ^  B  implies  A  *  B. 
-  p  a 

Since  A  satisfies  (1.30.10)  then  the  assumption  that  A  “  B  implies  that  B  also 
satisfies  (1.30.10).  According  to  Theorem  1.30.8  A  and  B  are  analytically  similar  to 
their  rational  canonical  form.  From  Theorem  1.30.2  it  follows  that  A  and  B  have  the 
same  rational  canonical  form. 


Problems 


Show  that  A(x)  and  B(x)  are  rationally  similar  over  the  ring  of  rational  functions  to 
H(2).  Prove  that  A  j*  H(2),  B  £  H(2).  Show  that  A  w  B  over  Ctx].  Prove  that  the 
matrices  given  in  (1.30.4)  are  not  equivalent  over  C[x]  .  That  is  A^  B(x). 

(1.30.13)  Let  n  be  a  positive  integer  and  assume  that  {m^}"  and  {ii)n  are  two 
sequences  of  non-negative  integers  satisfying  0  4  mn  <  m  ^ 


k  k  n  n 

0  <  4  <  4  <...<4,,  T  4  <  y  m  ,  k  =  1,...,n-1,  T  4  »  £  m  «  n.  Prove  (by 

n  n_1  1  i-1  1  i-1  1  i-1  1  1-1  1 

induction)  that 


n  n 

l  (2i-1)m  <  I  ( 2i-1 ) 4 
i-1  i-1 


and  the  equality  holds  if  and  only  if  m  -  4^,  i  -  1,...,n. 

(1.30.14)  Let  C  E  C,  n  »  1,2,...,  .  Assume  that  lim  C  -  C.  Suppose  that 
n  „  „  n 

flee,  such  that  c  e  C,  n  -  1,2,...,  ,  £  e  fl.  Using  the  fact  that  if  E  H(fl) 
“  n 

satisfies  (C  )  -  0,  n  »  1,...,  implies  5  0  prove  that  for  A,B  e  M  (fl)  the 

n  n 

assumption  A ( C  )  *  B(C  ),  n  “  1,2,...,  implies  that  A  *  B. 

n  n  * 


From  this  section  through  the  end  of  the  chapter  we  shall  restrict  ourselves  to  the 


matrices  whose  entries  are  analytic  in  domains  SI  £  C.  In  what  follows  we  give  a  global 
version  of  Theorem  1.29.19. 

Theorem  1.31.1.  Let  A(x)  e  M^dKfl)),  where  S2  is  a  connected  set  in  C.  Suppose  that 
(1.31.2)  I XI  -  A(x)|  =  ^  U,x)v'2U,x)  , 


where  £^(X,x)  and  ^_(X,x)  are  two  nontrivial  normalized  polynomials  in  H(fl)[x]  of  the 
degrees  n1  and  n2  respectively  such  that  «  (X  ,xQ )  ,<P  ( X  ,xQ ) )  =  1  for  any  x^  c  fi. 

Then  there  exists  a  unimodular  matrix  X(x)  such  that 


(1.31.3) 


X_1(x)A(x)X(x)  =  C  (x)  Q  C„(x),  X(x),X-1(x)  e  M  (H(S1)>  , 

12  n 

C.  ( x)  e  M  (H(fl)),  |  XI  -  c .  ( x)  |  =  *>  (A,x>,  i  -  1,2  . 

i  ii 


Proof.  Let  P^lx)  be  the  projection  of  A(x)  on  the  eigenvalues  of  A(x)  satisfying 

.(X,x)  *>  0.  Since  (*  (X,x  ),  Vs  (X,x  ))  =  1  clearly  P  (x)  e  M  (H(S2))  for  i  =  1,2 
i  i  u  2  o  in 

(See  Problem  1.28.24.)  Also  for  any  x^  the  rank  of  P^(x0)  is  Since  H(S1)  is 

EDO  each  P^(x)  can  be  brought  to  the  Smith  normal  form 


P.(x)  =U.(x>  diag(e'l)(x),...,e(l,(x),0,...,0)v  (x), 
ii  1  n ,  1 


U,  ,V,  .U-’.V-1  e  M  (H(Si)  )  . 

i  l  i  i  n 


As  rlPjtXp))  *  nA  for  any  xQ  e  (2  we  immediately  deduce  that  '(x)  *  1 ,  j  «  1,« 

i  ■  1,2.  Let  u!i,(x),...,ufl)(x)  be  the  columns  of  U,(x),  i  =  1,2.  Since 
7  n  1 

V_1(x)  €  M  (H (SI) )  we  get  that 


(1.31.4) 

for  any  x  e  fi.  Put 

X(x)  = 

According  to  ProMem 
(1.31.3)  follows  from 


P,  (x)Cn  *  !uf  5 '  (x) , . . .  ,u ( i '  ( x)  ] 


(u^ 1  '  (x) , . . .  ,u^  *(x),  u!"  ,(x),...,u<  Nxn  e  «  (H(f))) 

1  n1  1  n?  n 

(1.2P.30)  IX(xn) |  y  n  for  any  x^  e  0.  Bo  x  ^(x)  e  M^(w(P)>.  mow 
(1.2B.31). 

□ 
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1.32 


l 


.  First  variation  of  a  geometrically  simple  eigenvalue. 

Theorem  1.32.1.  Let  A(x)  be  a  continuous  family  of  n  x  n  complex  values  matrices  for 
lx  “  <  5,  where  the  parameter  x  is  either  real  or  complex.  Suppose  that 


(1.32.2) 


A(x)  =  Ap  «■  (x  -  x^)A1  +  |x  -  Xq|o(1) 


Assume  furthermore  that  X  is  a  geometrically  simple 

x\...,xm  and  y\...,ym  be  the  eigenvectors  of  Aq 

.  t  . 

form  a  biorthonormal  system  (y1)  x"1  “  6..,  i,j  =  1,.. 
enumerate  the  eigenvalues  of  A(x)  b£  X.(x),  i  =  1,. 


eigenvalue  of  multiplicity  m. 
and  Ap  corresponding  to  \ 

.  ,m.  Then  it  is  possible  to 
..,n,  such  that 


Let 

which 


(1.32.3) 


X^(x)  =  X  +  (x  -  +  !x  -  Xp I o( 1 ) ,  i  =  1,...,m  , 


where  U,/...,u  are  the  eigenvalues  of  the  matrix 
—  1  m  - - -  ■  1 


(1.32.4) 


.  t 

s  ■  (s.^)  c  sij  =  <y  >  a,*3,  i • j  “  if.,™ 


Proof.  By  considering  the  matrix  P-1A(x)P,  where  P  E  M  (C)  we  can  assume  that  A„  is 
-  n  P 

in  the  Jordan  canonical  form  such  that  the  first  m  diagonal  entries  of  Ap  are  \. 

From  the  proofs  of  Theorems  1.29.11  and  1.29.19  we  have  the  existence  of 


X(B)  »  I  +  2(B),  Z  £  M  (HI,  7  0)  -  0 
n  0 


such  that 


(1.32.5) 


X*1  (P)  (  A„ 


B)X  (B) 


C 

I  e  c.  (b)  ,  c^oi  =  \i(m) 


Substituting 


*  n  *»«*»  t-t 


B( x)  -  A(x)  -  AQ  -  (X'xq)A1  +  |x-xQ|o(1), 

X<x)  -  X(B<  x) )  -  I  +  tx-xQ)X1  +  Ix-x  |o(1) 


we  get 


C(x)  -  X-1(x)A(x)X(x)  -  Aq  +  (x-XjKA,  +  A0X1  -  X^)  +  Ix-Xglod)  . 


According  to  (1.32.5)  X„(x),...,X  (x)  are  the  eigenvalues  of  C,(B(x)).  As 

C  (B(x  ))  «  Xl(m)  by  considering  the  matrix  [C.)(B(x))  -  XX(m)]/(x  -  xQ )  we  deduce  that 
( X^( x)  -  X)/(x  -  Xg)  are  continuous  functions  at  x  »  xQ.  Also 


[C^BIx))  -  XI(m)]/(x  -  xQ) 


((h)  (A1  ♦ 


Vi  ‘Vo 


Kj> 


+  0(1) 


where  E1  -  y1  «  <4^,..., 
eigenvectors  of  AQ  and 

(Tli,t(A0Xi  '  xiaok3  “  ° 
choice  of  eigenvectors 

choice  of  the  eigenvectors 
amounts  to  a  new  matrix  S 
same  eigenvalues. 


4.  ),i  “  1,...,m.  Now  note  that  since  E*  and  n*  are  the 
in 

Afc  respectively  corresponding  to  X  for  i  »  1 . . 

o 

for  1  <  i,j  <  m.  This  establishes  the  result  for  a  particular 
and  It  is  left  to  note  that  any  other 

x1  ...  and  y  1 . .  *  * .  ym  which  form  a  blorthogonal  system 

1  which  is  Bimilar  to  S.  In  particular  S  and  S1  have  the 


Problems 
(1.32.6)  Let 


A(x>  - 


-1  3°- 


1.33.  Analytic  similarity  over  Hq 

Let  A,B  EM  ( H  ) .  That  is 
n  0 


(1.33.1) 


A(x)  =  £  A^x1',  I x |  <  r( A) 

k=0  * 


B(x)  -  J  B  xk,  |  x  |  <  r(B) 
k-0  * 


Definition  1.33.2.  Let  A, B  £  M  (H_),  Denote  by  n(A,B)  the  index  and  <  ( A.B)  the 
■"  _ "  n  o  -  p  — ■  ■ 

number  of  local  invariant  polynomials  of  degree  p  of  the  matrix  I  ft  A(x)  -  Bt(x)  ft  I. 

Theorem  1.33.3.  Let  A,F  E  M^(Hg).  Then  A  and  B  are  analytically  similar  over  Hp 

and  only  if  A  and  B  are  rationally  similar  and  there  exist  n(A,A)  +  1  matrices 

T„,...,T  E  M  (C),  (n  -  n(A,A))  such  that  |T„|  M  0  and 
0  n  n  1  ■  u  — — 


k 

(1.33.4)  l  <A1Tk_i  -  Tk_iBi)  -  0,  k  -  0 , 1 ,  .  .  .  ,n<  A,  A)  . 


Proof .  The  necessary  part  of  the  theorem  is  obvious.  Assume  now  that  A(x)  J  B(x)  and 
the  matrices  T^,...,T  satisfy  (1.33.4)  and  TQ  is  non-singular.  Put 

-1  r  k 

C(x)  *  T(x)B(x)T  (x),  T(x)  =  l  T  x  . 

k-0  * 


As  |Tq|  i*  0,  B(x)  *  C(x),  so  A(x)  »  C(x).  This  in  particular  means  that  r( A, A)  - 
r(A,C).  Also  (1.33.4)  is  equivalent  to  A(x)  -  C( x)  -  xn+10(1).  Hence 

(I  ft  A(x)  -  At(x)  ft  I)  -  (I  ft  A(x>  -  Ct(x)  ft  I)  -  xT1+10(1)  . 


if 
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In  view  of  Lemma  1.13.4  the  matrices  I  8  A(  x)  -  At(x)  8  I,  I  8  A(x)  -  C'ttx)  8  I  are 
equivalent  over  Hq.  In  particular  n(A,A)  =  n(A,C).  Also  1,0,... ,n  satisfy  the  sysr. 
(1.33.4)  where  Bj  -  ,  i  =  0,1,...,0.  According  to  Theorem  1.13.14  there  exists 


P(x)  £  M  (H.)  such  that 
n  0 


A(x)P(x)  -  P(x)C(x)  =  0,  P  ( 0 )  =  I 


This  shows  that  A(x)  *  C(x).  By  the  constriction  C(x)  *  P(x),  so  A(x)  *  B(x), 


Note  that  if  n(A,A)  =  0  then  the  assumptions  of  Theorem  1.33.3  are  equivalent 

A(  x)  =5  B(x).  Then  the  implication  that  A(  x)  *  B(  x)  follows  from  Corollary  1.30.11. 
p  a 

Suppose  that  the  characteristic  polynomials  of  A(x)  splits  over  H„.  That  is 


(1.33.5) 


III  -  A( x ) I  =  n  (X  -  X . (x) ) ,  X . ( x)  c  H  ,  i  =  1 . n 

i= 1  1  1 


As  Hn  is  ED  according  to  Theorem  1.1B.5  A(x)  is  similar  to  an  upper  triangular 
matrix.  Using  Theorem  1.2Q.1Q  and  Theorem  1.1ft.  5  we  get  that  A(x)  is  analytically 
similar  to 


(1.33.6) 


C( x)  =  l  9  C.(x),  C . ( x )  f  M  !H  ) ,  ( a . I ( n . )  -  C . ( 0 ) >  1  =  0  , 

.  4  l  a  n .  0  li  l 

1=1  l 

a.  =  X.  (0),  a.  ¥  ci.  for  i  ¥  ' ,  i»i  - 
ii  i  : 


and  each  C^(x)  is  an  upper  triangular  matrix.  In  this  case  w-  ar.  he  more  specific  above 
the  form  of  the  upper  triangular  matrix. 

Theorem  1.33.7.  Let  A(x)  c  M  (H„).  Assume  that  the  character i st i c  polynomial  of  A(x> 
-  — —  n  ■  -■  ■  -  ■  ■  — 

splits  in  Then  A(  x)  is  analytically  similar  to  a  block  diagonal  matrix  C(  x)  of 

the  form  (1.33.6)  such  that  each  C^(x)  is  an  upper  triangular  matrix  whose  off-diagonal 
entries  are  polynomials  in  x.  Moreover,  the  degree  of  each  polynomial  entry  above  the 


diagonal  in  the  matrix  C^(x)  do  not  exceed  i  =  1,...,C  • 


W 


Proof.  Clearly,  in  view  of  Theorem  1.29.19  we  may  assure  that  4  ■  1,  That  is  A(0) 
one  eigenvalue  a0*  Furthermore  by  considering  A(x)  -  oigl  we  may  assume  that  A(0) 
nilpotent.  Also  by  Theorem  1.18. IS  we  may  assume  that  A(x)  is  already  in  the  upper 
triangular  form.  Suppose  now  that  in  addition  to  all  the  above  assumptions  A(x)  is 
nilpotent.  Define 

X.  »  {y  |  A^y  -  0,  y  e  M  .(H)},  k  -  0,1,...,  . 

K  tl  1  0 

So 

[0]  *  Xq  ^  ^  Xj  Xp  *  Mn1(H0)  • 

According  to  Theorem  1.11.12  it  is  easy  to  show  the  existence  of  a  basis  y 1(x) , . . . ,yn( x) 
1 

in  Mn(HQ)  such  that  y  (x),...,y  (x)  is  a  basis  in  Xk.  As  A(x)X){+1  c  xfc  we  have 

iL 

vk 

A(x)yj  -  l  gijtx)yi(x),  ^  <  j  <  *k+1  . 

Define  *)  “  0  for  i  >  ^  and  <  j  «  Put 

G(x)  -  ( g .  ( x) ) T(x)  -  (y 1 ( x) , . . . ,yn(x) )  e  M  (H  ) 
i  no 

Since  y1 (x) , . . . ,yn(x)  is  a  basis  T  '(x)  e  M  (H„).  So 

n  u 

G(x)  ■  T*1  (x)A(  x)T(x)  ,  8  -  n  ( A ,  A)  -  r\(G,G) 

Put 

00  )( 

G(x)  -  l  G  xj,  G<k)  -  l  G  xj,  k  -  0,1 . 

j-0  3  j-0  3 


has 

is 
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] 


We  claim  that  G*s'(x)  *  G(x).  First  note  that 

(i  e  G(x)  -  Gt(x)  *  i)  -  {i  a  g(s)(x)  -  [g<s)(x))  e  1}  =  x(s+1)o(i)  . 

From  Lemma  1.13.4  it  follows  that  the  matrices  I  B  G(x)  -  Gfc(x)  B  I,  I  *  G*s*(x)  - 
[G^s'(x)]t  8  X  have  the  same  local  invariant  polynomials  up  to  the  degrees.  So 

r(G,G)  <  r(G<S),G(S>) 

which  is  equivalent  to  the  inequality 

(1.33.8)  V(G(S),G(S)>  <  V(G,G>  . 

Let 


\  *  (y  I  y  •  (y, -•••  »yn)t.  -  0  for  j  >  . 

Clearly  if  g^(x)  =  0  then  the  (i,j)  entry  of  G^sNx)  is  also  equal  to  zero.  By  the 
construction  g^x)  *  0  for  1  >  and  ^  <  j  <  So  G*s'(x)y)t+1  £  Y^, 

k  »  0,...,p-1.  By  Theorem  1.24.1 

(1.33.9)  v(G(x0),G(x0>)  <  v(G(S)(x0),G<S,(x0)) 


for  all  xn  in  the  neighborhood  of  the  origin.  So 

v(G,G>  <  v(G<S),G(,>). 
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This  proves  that  the  equality  sign  holds  in  (1.33.6)  which  in  return  implies  the  equality 
sign  in  (1.33.9)  for  0  4  |x^|  <  p.  By  Theorem  1.24.1  G(Xq)  *  G*s'<Xq)  for 
0  4  I x 0 1  <  p.  Using  Theorem  1.30.2  we  deduce  that  G  *  G*s*.  As 

G ( x) I  -  IG(s)(x)  -  xs+10(1) 

Theorem  1.33.3  implies  that  G(x)  *  G^(x).  This  establishes  the  theorem  in  case  that 
A( x)  is  a  nilpotent  matrix.  Next  consider  the  case  where  A(x)  is  an  upper  triangular 
matrix  whose  diagonal  entries  (the  eigenvalues  of  A(x))  are  arranged  in  the  following 
order 

A(x)  «  (A.  .(x))4,  A  ,  { X)  E  M  (H.)  , 

il  ij  nin j  D 

ni 

(1.33.10)  Aij<x)  *  0  for  1  <  "  X1(x)l(n1))  -  0  , 

X^x)  f  X^(x),  for  1  f  j,  i,j  -  . 

We  already  showed 


Ali(x)  -  T^’(x)Fii(x)Ti(x),  T1,t‘1  e  Mn(H0)  , 

and  each  (x>  “  D  (xlKn^)  is  a  nilpotent  upper  triangular  matrix  with  polynomial 

entries  of  the  form  described  above.  Put 

*  '  » 

G( x)  *  (G  < x) )  -  T  (x)A(x)T(x) ,  T(x)  »  \  ®  T  (x) 

‘J  1  1-1  1 

As  X ^ ( x )  >  ^fx)  for  i  f  j  Problem  1.21.21  implies 

i 

v(G.G)  -  l  V<GU<G11>  • 
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Let  G*  *(x)  be  defined  as  above.  According  to  Theorem  1.23.6 


WG(k>.G(k,>  >  I  (G{k).G{ik), 
i=1 


Using  Theorem  1.24.1  as  before  we  get 


V(G. . ,G. . )  <  v(G(k) ,G(k) )  . 

11  11  11  11 


Combine  the  above  inequalities  to  deduce  that 


\><G,G)  <  v<G(S\G<S)) 


Compare  the  above  inequality  with  (1.33.8)  to  obtain  the  equality  sign  in  (1.33.8).  So 


(1.33.11) 


"«>  =  V(Gii'Gii>’  1  =  1 . 1 


K 

D.  (x)  «  X.(x)I(n.)  =  VL  D..x^,  D(k,(x)  =  £  D  .  x^ ,  i  =  1,...,4  , 

i  i  i  13  i  iD 


(1.33.12) 


=  l  9  D.(x),  D(k,(x)  »  J  9  D^k 


,_(s)  _ ( S )  (s)  n(S)  _  . 

V(G. .  “  D,  >  G  *  D.  )  =  v(G. .  •  H  ,  G  -  D  ) 

li  i  ii  i  li  i  ii  i 


As  before  using  Theorem  1.24.1  we  deduce  that  g!*^  -  d!S*  *  G.,  -  D. ,  i.e., 

ii  i  r  ii  1 

g|8'  -  d{8)  +  ^  Since  { x)  ?  X_j(x)  for  i  j  we  finally  deduce  that 

G  *  G*8'  -  0<8)  +  D.  Also  GI  -  KG*8'  -  D*8'  *  D)  »  x8+10(1).  Therefore  according  to 
Theorem  1.33.3  G  »  G*8'  -  D*8'  +  D.  The  proof  of  the  theorem  is  completed. 


Theorem  1.33.13.  Let  P(x)  and  Q(x)  be  matrices  of  the  form  (1.33.6) 


n 


E  mi 

P(x)  -  )  •  P.(x),  P.(x)  t  M  (Ha),  (a. I(m.)  -  P . (0 ) )  *  -  0  , 

1  0  11  1 


ai  M  for  i  ¥  j.  i.j  “  1 . . 


(1.33.14) 


?  "j 

Q(x)  -  l  9  Q^x),  Oj  e  Mn  (H0),  (SjKrij)  -  0^(0 > )  J  «  0  , 


8i  *  Bj  for  i  *  j,  i.j  -  1, 


Assume  furthermore  that 


(1.33.15)  ai  »  Bi(  i  -  1 . t,  al  *  t  •  t+1,...,p,j  -  t+1 . . 

0  <  t  <  min(p.g)  . 

Then  the  non-constant  local  invariant  polynomials  of  I  »P(x)  -  Qt(x)  ®l  consist  of  the 
non-constant  local  invariant  polynomials  of  I  ®  Pjlx)  -  Q*(x)  •  I,  i  «  1,...,t.  That  is 

t 

(1.33.16)  K  (P.Q)  -  [  <  (P. ,C. ).  P“  . . 

P  i-1  P  i  1 

In  particular  if  C(x)  is  of  the  form  (1.33.6)  then 


(1.33.17) 


n(c,c) 


max  n(C  ,C  ) 
1<i<£  1 


Proof .  According  to  Theorem  1.13.14 


Kp(P,C> 


dim  Wp_, 


dim  Wp  , 


where  wp  £  Mn(C)  is  t*le  subspace  of  n  *  n  matrices  XQ  such  that 


(1.33.18) 


I  (P,  .X.  -  X.O  . )  =  0,  k  =*  0,..., 

j=o  3  ’ 


Here 


-  I  B 

j=0 


.x’,  P  (x)  -  l  P(ilx^,  P  -  f  •  »(1!  , 

3  1  3=1  3  j  i=1  j 


C(x)  =  l  0.xj,  0.{x)  =  l  £ ?'l)x3,  0.  =  1  »  o'1’ 


j**0 

.<3),  v(j) 


j-0 


'j  -  i  ' 

J  i=1  J 


Partition  X.  to  (X  a  ),  X  .  e  M  (C) ,  a  =  1,...,p,  8  =  1,...,q.  We  claim  that 
J  <*p  ap  m  n„ 

( i)  a  ® 

X  o  =0  for  if  either  a  >  t+1  or  8  >  t+1  or  a  4  B.  This  statement  follows  easily 
an 

by  induction  since  in  view  of  Lemma  1.21.5  the  equation 


p'a>  V  -  vo'6’  -  0 
0  0 


has  only  the  trivial  solution  for  those  a  and  8*  Thus  (1.33*18)  splits  to  the  systems 


I  -  x!j>Q*n)  =  0,  i  =  1,...,t  . 

jio  11  11 


-  1  4  8  - 


f 


t  to  deduce 


Apply  the  characterizations  of  <  (P,C>)  and  x  (P.,Q),  i  =  1 

p  P  i  i 

(1.33.16).  Clearly  (1.33.16)  implies  (1.33.17). 

□ 

We  close  this  section  by  remarking  that  the  main  assumption  of  Theorem  1.33.7  that  the 

characteristic  polynomial  of  A(x)  splits  in  HQ  is  not  a  heavy  restriction  in  view  of 

the  Weierstrass  preparation  theorem,  Theorem  1.6.5.  That  is  the  eigenvalues  of  Aty™) 

split  in  H„.  If  we  choose  m  =  n!  then  this  statement  holds  for  all  A(x)  e  M  (H„). 

o  n  0 

According  to  Problem  1.33.19  A(x)  |  B(x)  if  and  only  if  A(ym)  B(ym).  Therefore  the 
classification  problem  of  analytic  similarity  classes  reduces  by  Theorem  1.33.7  to 
determine  the  structure  of  the  polynomial  entries  which  are  lying  above  the  main 
diagonal.  Thus  given  the  rational  canonical  form  of  A(x)  and  the  index  n(A,A)  the  set 
of  all  possible  analytic  similarity  classes  which  may  correspond  to  A  is  a  certain  finite 
dimensional  variety. 

The  case  n  «  2  is  classified  completely  (Problem  1.33.20).  In  this  case  to  a  given 
rational  canonical  form  there  are  at  most  countable  number  of  analytic  similarity  classes. 

For  n  »  3  we  already  have  examples  in  which  to  a  given  rational  canonical  form  there 
may  correspond  a  finite  dimensional  variety  of  distinct  similarity  classes  (Problem 
1.33.21  ). 

Problems 

(1.33.19)  Let  A(x),B(x)  c  M  (H„).  Let  m  be  a  positive  integer.  Assume  that  A(y™)T(y) 

n  o 

-  T(y)B(ym>,  T(y)  e  M  (HI.  Show  that 
n  o 


A( x)Q(x) 


1  m 

S>(x)B<  x)  ,  Qfy")  -  -  l  T 
"  k-1 


Q(X)  E  M  (H.)  . 

n  o 


Prove  that  A(x)  J  B(x)  if  and  onlv  if  A(ym)  J  B(ym). 

a  a 

(  1.33.20)  Let  A(x)  e  M2<H0!  an<1  assune  that 
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i 


|XI  -  A(  x)  I  -  (X— (x) )  < X— X2 ( x) ) ,  X1(x)/X2<x)  c  Hq). 


Vx>  “  l  i  »  1.2.  X(1)  -  x‘2\  j  -  0 . .  x'J*  ¥  x'2)  , 

1  j=0  J  X  3  p+1  p+1 


-1  <  p  <  *>(p  =  “  means  X^lx)  =  X^fx)) 


Prove  that  either  A(x)  is  similar  to  a  diagonal  matrix  or  A(x)  is  similar  to 


B(x) 


X^x)  x 


X2<x)  , 


,  k  =  0, 1 , . . . ,p  (p  >  -1 ) 


In  the  second  cast  n(A,A)  =  k.  (Hint:  Use  a  similarity  transformation  of  the  form 
DAD-*  where  D  is  a  diagonal  matrix.) 

(1.33.21)  Let  A  e  M3(Hq).  Assume  that  A(x)  *  C(p),  p(X,x)  »  X(X-x2m) (X-x4m) ,  m  >  1. 
Prove  that  A(x)  is  analytically  similar  to  a  matrix 


4 

0 

V 


0  <  k,.  k2  <  ®  (x“  =  0) 


where  a(x)  is  a  polynomial  such  that  deg  a  <  4m.  (Use  the  previous  problem.)  Assume 
that  k^  =  k2  “  m.  Show  that  B(x,a)  *  B(x,b)  if  and  only  if 


(i) 

b-a 

is  divided  by 

xm 

in  case  that  a ( 0 )  ¥  1. 

(ii) 

b-a 

is  divided  by 

xm+k 

in  case 

that  a ( 0 )  =  1  and  a(i,(0)  =  0,  i  =  1,...,k-1 

¥  0 

for  1  <  k  < 

m. 

tiii) 

b-a 

is  divided  by 

x2ra 

if  a ( 0  ) 

-  1,  a(i)(0),  i  -  1 , . . . ,m-1 . 

That  is 

for 

k)  =  kj  =  m 

the  set  of  all 

analytic  similarity  classes  of  matrices  B(x,a) 

can  be  regarded  as  a  union  of  m  +  1  copies  of  (f  -  {0},  which  are  m  nonfixed 


coefficients  of  b(x) 


(1.33.22)  Let  P  and  0  satisfy  the  assumptions  of  Theorem  1.33.13.  Show  that  P  and 


Q  are  analytically  similar  if  and  only  if  p  *  q  -  t,  m^  ■  n^,  i  -  1,...,t  and 
P^(x)  a  for  i  “  1.....t. 


1.34.  Similarity  to  diagonal  matrices 


Theorem  1.34.1.  Let  A(x)  e  M^tHp)  and  assume  that  the  characteristic  polynomial  splits 
in  Hp  as  given  in  (1.33.5).  Let 

(1.34.2)  B(x)  =  diaq( X  ( x) .....  X  (x))  . 

1  n 

Then  A(x)  and  B(x)  are  not  analytically  similar  if  and  only  if  there  exists  a  non- 
negative  integer  p  such  that 


(1.34.3) 


<  ( A, A)  +  <  (B,B)  <  2k  (A,B)  , 

P  P  P 

<_.  (  A,  A)  +  K.(B,B)  =  2<_.(A,B),  j  =  0,...,p-1,  if  p  >  1  . 


In  particular  A(x)  and  B(x)  are  analytically  similar  if  and  only  if  the  three  matrices 
given  (1.22.6)  are  equivalent  over  Hp. 

Proof ■  Suppose  first  that  (1.34.3)  holds.  Then  the  three  matrices  in  (1.22.6)  are  not 
equivalent.  Hence  A(x)  |  B(x).  Assume  now  that  A(x)  |  B(x>.  Without  a  restriction  in 
the  generality  we  may  assume  that  A(x)  =  C(x)  where  C(x)  is  given  in  (1.33.6).  Let 


B  (  x ) 


£ 

®  B.(x),  B.(0)  =  a.Itn.),  j  “  1, ...,£,  m. 

3  3  3  3  3 


n  +. ■ • +n  .  (n  «D ) 
0  j  0 


We  prove  (1.34.3)  by  induction  on  the  dimension  n.  For  n  =  1  the  theorem  is  obvious. 
Assume  that  the  theoren  holds  for  n  <  jj-1,  Let  n  -  N.  If  A(0)  ¥  B(  0 )  then  Theorem 
1.22.3  implies  the  inequality  (1.34.3)  for  p  »  0  and  the  theorem  is  proved.  Suppose  now 

that  A ( 0 )  *  B ( 0 ) .  That  is  A^(0)  =  B  (0)  =  a^I(n^),  j  -  1 . 1.  Suppose  that  i  >  1. 

By  Theorem  1.33.13 
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1.3S.  Strict  similarity  to  diagonal  matrices. 

Let  A(x)  e  M  (H  )  •  According  to  the  Weierstrass  preparation  theorem  {Theorem  l.s.sj 
n  u 

the  eigenvalues  of  Aly1")  are  analytic  in  y  for  some  1  <  m  <  n.  That  is  the 
eigenvalues  X„(x),...,X  (x)  of  A(x)  are  multivalued  analytic  functions  in  x  which 

i  n 

have  the  expansion 


eo 

i  .  r  .  k/m 

X1<x)  =  L  -  3  »  1,...,n  . 

3  k=0  3* 

In  particular  each  X  (x)  has  at  most  m  branches*  For  more  properties  of  the 

eigenvalues  \.(x),  j  =  see  Kato,  [1976,  Chapter  2],  Let  A(x)  e  M  (C[x])«  So 

J  n 

m 

(1.35.1)  A(x)  =  l  A  x  ,  Ak  e  Mn<C),  k  =  0,1,... ,m  . 

k=0 

The  eigenvalues  of  At  x)  satisfy  the  equation 


(1.35.2)|AI  -  A(x)|  =  An  +■  j  a.(x>An  ^  =  0,  a.tx )  e  C[x] ,  j  «  1,...,n  , 

j-0  3  3 


Thus  the  eigenvalues  A  (x),...,A  (x)  are  algebraic  functions  of  x  (see  for  example 
I  n 

Gunning-Rossi  (1965J).  The  equation  (1.35.2)  describes  one  or  several  compact  Riemann 
manifolds  according  to  the  polynomial  (1.35.2)  is  irreducible  or  reducible  in  C[x,X). 
According  to  the  Weierstrass  preparation  theorem  X^(x)  has  the  followina  expansion  aroun 
x  -  0 


(1.35.3)  X  . ( x)  -  v  X  (C)(x-C)k/m,  j  =  1 , . . . ,n  . 

3  k-0  3 

The  number  m  can  be  chosen  to  be  independent  of  ?.  For  example  m  =  n!  will  always  he 
correct.  These  expansions  are  called  the  Pulseaux  expansion  of  X_.(x).  since  A(x1  is  a 
polynomial  matrix  each  X_.(x)  can  be  expanded  in  the  neighborhood  of  <*>.  To  do  so  we  not 


that 


A(x)  -  xmB(^),  B(y)  -  l  A^y™-1'  • 

k«0 

Expand  now  the  eigenvalues  of  B(y)  at  y  ■  0  to  get 

00 

(1.35.4)  X  (x)  «  x"  [  (~)x  k/,m'  1  "  1"**'n  • 

J  k=0  3 

Definition  1.35.5.  Let  A(x),B(x)  £  kMCtx]).  Then  A(x)  and  B(x)  are  called  strictly 

similar  (A  *  B)  if  there  exists  P  e  Mn*c' '  lpl  ?  0  such  that  B(x)  «  PA(x)P-1. 

From  Lemma  1.22.8  it  follows  that  if  A( x)  »  B(x)  then  the  three  matrices  in  (1.30.4) 
are  equivalent  over  Ctx]  .  To  take  in  account  the  point  x  =  ">  we  need  to  homogenize  as  in 
Section  1.14. 

Definition  1.35.6.  Let  A(x)  be  given  by  (1.35.1).  Denote  by  Alxg.x^  the 

corresponding  homogeneous  matrix 

m*  ,  . 

(1.35.7)  A(x  ,x  )  -  I  Aj^x”  "  x*  E  Mn(C[x0,Xl)  )  , 

k**0 

where  m’  -  0  if_  A(x)  -  0  and  A^,  ¥  0,  A^  =  0  for  m1  <  j  <  m  _if  A(x)  ¥  0. 

Clearly  if  B(x)  -  PA(x)P_1  then  Btxg.x,)  -  PA(x„,x1 )p-1.  According  to  Lemma  1.22.8 
the  matrices 

(1.35.8)  1  0  A(x0,x1)  -  At(x(),x1)  a  I,  I  B  AlXg.x,)  -  Bt(x(),x1)  a  I  , 

X  B  B( Xp ,x^ )  -  Bt(x0,x1)  a  I  . 

are  equivalent  over  C(Xg,x^].  Thus  from  Lemma  1.10.3  it  follows 

Lemma  1.35.9.  Let  A(x).B(x)  £  M  (dxl).  Assume  that  A(x)  *  B(x).  Then  the  three 
—  “■  n  3  ““ 

matrices  in  (1.35.8)  have  the  same  invariant  polynomials  over  Clxp.x^. 
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*k'*0'x1*'  k  =  the  invariant  factors  of  the  matrix  I  9  AtXQjX^)  - 

st ( Xp ,x1 )  a  i. 

As  in  the  proof  of  Lemma  1.14.8  we  have  that  i^lx^x.,)  is  a  homogeneous  polynomial 

for  k  =  1 , . . . ,r( A,B) .  Moreover  i^fl.x),  k  =  1 , . . . , r( A, B) ,  are  the  invariant  factors 

of  I  a  A(x)  -  Bfc(x)  a  I.  (See  Problems  1.14.30  -  1.14.31.) 

We  now  answer  the  problem  when  A(x)  is  strictly  similar  to  a  diagonal  matrix 

B(x)  e  M  (Ctx))  of  the  form  (1.34.2). 
n 

Theorem  1.35.11.  Let  A(x)  e  M  (Ctxl).  Assume  that  the  characteristic  polynomial  of 
A( x )  splits  to  linear  factors  over  C(x].  Then  A(x)  is  strictly  similar  to  the  diagonal 
matrix  given  by  (1.34.2)  if  and  only  if  the  three  matrices  in  (1.34.8)  have  the  same 
invariant  factors  over  C[x0,x.,]. 

Proof .  without  loss  of  generality  we  may  assume  that  B(x)  is  of  the  form 


(1.35.12)  B(x)  =  l  1  (*)!(»  ),  XA(x)  -  \  (x)  fo/  i  -  j,  i,j  =  1 . 1 

i=1  1  1  3 


Thus  for  all  C  except  a  finite  number  of  points  we  have 


(1.35.13) 


*.<?>  J1  V(c>,  i,j  = 


Let  Pj(x)  be  the  projection  of  A(x)  on  X^(x),  j  £  1,...,i.  Suppose  that  (1.35.13)  is 
satisfied  at  Xg.  According  to  Problem  1.28.24  each  P^(x)  is  analytic  in  the 
neighborhood  of  ?.  Assume  that  (1.35.13)  does  not  hold  for  C  e  C.  The  assumption  that 
the  three  matrices  in  (1.35.8)  have  the  same  invariant  polynomials  imply  that  the  matrices 


in  (1.30.4)  are  equivalent  over  H^.  Now  use  Theorem  1.34.1  to  get  that  A(x)  and  B(x) 
are  analytically  similar  over  H^.  Clearly  P^(B)  the  projection  of  B(x)  on  \^(x)  is 
equal  to  0  ©  I(n.)  ©  0.  Prom  Problem  (1.28.27)  we  deduce  that  P.(x)  is  also  analytic  in 


the  neighborhood  of  ;.  The  same  arguments  apply  for  ;  “  This  follows  by  considering 
the  matrices  A(xQ,1)  and  B(x0,1).  Thus  P^(x)  is  analytic  on  the  Riemann  sphere.  In 
particular  P^(x)  is  bounded.  The  Liouville  theorem  (e.g.  Rudln  [1974])  implies  that 
Pj  is  a  constant  matrix.  Let 


P.C 

1 


tx 


jl 


,  ,x 


M.  u 


,1,  X 


,  11  1n1 
(x  , . . . ,x  , 


il 


in 

.x  *) 


E  M  (C) 
n 


According  to  Problem  1.28.28  x_1A(x)X  ■  B(x)  for  C  which  satisfy  (1.35.13).  Finally 
the  analyticity  of  A(x)  and  B(x)  implies  the  validity  of  the  above  equality  for  all  x. 

□ 

Let  A(x)  e  M^fC).  Suppose  that  A(x)  is  strictly  similar  to  a  diagonal  matrix 
B(x).  Consider  the  corresponding  homogeous  matrix  ACxq.x^.  Then  we  obviously  have  that 
for  any  C„/51  £  C,  AfCg,^)  is  similar  to  a  diagonal  matrix,  i.e.,  A(;  ,^1  is 

dlagonable.  However  if  AtC^,^)  is  diagonable  this  does  not  imply  that  A(x)  is 
strictly  equivalent  to  some  diagonal  matrix.  For  example 

(1.35.14)  A(x)  -  +  (o  o)X+  (o  i)*2  * 

See  Problem  1.35.24.  We  now  give  a  sufficient  condition  on  A(x),  such  that  A(Cg.Cf)  is 
diagonable  for  any  which  implies  that  A(x)  is  strictly  similar  to  a  diagonal 

matrix. 

Definition  1.35.15.  Let  A(x)  E  M^tCCx])  be  of  the  form  (1.35.1)  normalised  by  the 
condition  A^,  f  0  _i£  m  *  1.  Let  X^(x)  and  X^( x )  be  two  eigenvalues  of  A(x).  The 
eigenvalues  X  (x)  and  X  (x)  are  said  to  touch  at  C  if  the  Puiseaux  series  of  X  (x) 

— a -  P  -  g  -  -  P 


and  X  (x)  at  x  «  c  satisfy  the  following  relations 


(1.35.16) 


Vc)  ’  v°'  k  ■ 0 


s 


for  a  finite  or  infinite  C . 

Theorem  1.35.17.  Let  A(x)  e  M  (C[x] )  be  of  the  form  (1.35.1)  normalized  bv  the 
-  n  . . — .  ■  -  * 

condition  A^  ¥  0  m  >  1 .  Assume  that  the  corresponding  homogeneous  matrix  A  ( xQ ,  x ) 

is  diagonable  for  any  e  c*  Suppose  furthermore  that  no  two  distinct  eigenvalues 

of  A(x)  touch  at  any  finite  or  infinite  point  of  the  Riemann  sphere.  Then  there  exists  a 

constant  matrix  X  €  M  (C) ,  I X I  ?  0  such  that 
-  n  - - 

-1  r  k 

(1.35.18)  X  A( x)X  =  l  D  x  , 

k=0 

where  DQ , . . . , Dm  are  diagonal  matrices. 

Proof .  Clearly  we  can  view  A(x)  as  a  matrix  in  Mn(W),  where  A)  is  the  field  of 
rational  functions.  Let  1C  be  a  finite  extension  of  all  such  that  I XI  -  A(x)|  splits  to 
linear  factors  over  K.  Thus  A(x)  t  distinct  eigenvalues  X^ (x) , . . . , X^(x) ,  such  that 
X^fx)  has  multiplicity  n^,  k  =  1 ,...,4.  Thus  for  all  except  a  finite  number  of  points 
(1.35.13)  holds.  Assume  that  ;  satisfies  (1.35.13).  Denote  by  P_.(;)  the  projection 
of  A(x)  on  X^(?),  j-1, According  to  Problem  1.28.24  P^(x)  is  analytic  in  the 

neighborhood  of  £.  Also  in  view  of  Problem  1.28.32  P^(C)  is  given  by  the  formula 

£ 

(1.35.19)  p. (?)  ”  n  IA(C)  —  X  ( C ) I 1 /( X . ( C >  *  X  ( C 1 1  . 

j  k.1,wj  k  j  * 

g 

We  claim  that  in  the  neighborhood  of  any  point  c,  V  Ay  +£)  is  analytic  in  y.  (In  case 
that  5  “  “  Pj(y_s)  is  analytic  in  y  in  the  neighborhood  of  the  origin.)  Clearly  it  is 
enough  to  consider  the  points  5  at  which  (1.35.13)  is  violated.  For  simplicity  of 
notation  we  consider  P^(x)  in  the  neighbornood  of  C.  Assume  first  that  ;  is  finite. 
Suppose  that 
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n  •  X1(C)-...»XU(C),  Xk(c)  j  X^c),  k  -  u+1 , . . .  ,£  . 

According  to  Theorem  1.29.19  there  exists  0<x)  c  M  (H  ) ,  IO(C)|  1  0  such  that 

n  C 

S  '(xlAlxlQfx)  -  C,(x)  •  C  (x),  C,(x)  EM  (H  )  , 

1  2  1  n>1  C 

u 

C  <  x  >  e  M  (H  ) ,  m  «  V  n  ,  m  «  n-m 

2  m2  !  i-1  1 

Moreover  the  eigenvalues  of  C,(x)  and  C2(x)  are  Xf  (x) , . .  .,X  (x)  and 

Xu+i<x),...,X4<x)  with  the  multiplicities  n1(...,nu  and  nu+1 . n^  respectively. 

Since  A(x)  is  diagonable  for  each  x,  C^tx)  and  C2(x)  are  also  diagonable  in  the 
neighborhood  of  (.  So 


i 

n 

k-u+1 


(C2(x) 


Xk(x)I)  -  0 


which  yields 


J 


t 


r 


i 

n  (A(x)  -  X  (x)I  )/[X  (x)  -  X,  (x)I  - 
k-2  k  1  k 

-1  * 

Q  (x) {  n  [C  (x)  -  X  (x)I]/[X  (x)  -  X  (x)]  «  0}Q(x)  . 

k-2  K  1  * 


The  assumption  that  n  -  X1  (?)-..  .-X^t;;)  and  the  diagonability  of  C^tx)  imply 


C,  <x)  -  r,I  +  (x-;)c3(x)  . 


Therefore  the 
j  -  1 , . . . ,u. 


Puiseaux  series  of  X^x) , . . . ,  Xy(  x)  satisfy  X^tt;)  -  0,  k  -  1,...,s-1, 
As  no  two  distinct  eigenvalues  of  A(x)  touch  at  ?  we  get  that 
u.  so 
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X,sU>  1  Xjg(c),  for  j  -  2 


[ 


f 


u 

n  ic^v3*?)  -  \ ^ < ys+c )i]/rx1<ys+c)  -  x.(ys+cn 

3=2  3 

u  *>  oo 

-  n  (Cj  (ys+c )  -  (  *  '  k(Oyk_s)i]/{  I  [X  <c>  -  x^k ’ yk-s}  . 

j=2  k=s  J  k=s  3 

s  s 

Also  <  V ^  ( y  +C  )  -  A ( v  +C  ) )  c  H r  for  j  =  u+1, ...,£.  This  shows  that  P^(yS+£)  is 

analytic  in  H.  for  any  finite  ?.  By  considering  A(x)  =  xmA(-)  we  transform  "  to 
u  x 

0  and  the  same  result  applies  to  C  =  ".  In  particular  we  have  that  P,(x)  is  bounded  on 

the  Riemann  sphere.  Put  P^x)  =  (pj^'(x)).  Let  max|p!J)(x)|  =  |p|^'(5i  )|, 

( 1 )  s 

( s i j  may  he  ").  As  p_  (C  +  y  )  is  analytic  in  y  the  maximum  principle  implies 
ID  s 

that  p  <c  +  y  )  is  constant  in  the  neighborhood  of  the  origin  (e.g.  Rudin  [1974]). 

The  analytic  continuation  principle  yields  that  p^'(x)  is  constant.  Hence  p,  =  P^x) 
is  constant  and  in  the  same  manner  we  deduce  that  all  Pj(x)  are  constant.  Define  the 
matrix  X  as  in  the  proof  of  Theorem  1.35.11  to  deduce  (1.35.18). 

D 

Corollary  1.35.20.  Let  A(x)  be  of  the  form  (1.35,1),  Assume  that  the  matrices 
A0'"*'Am  are  diagonable  and  A^  ?  A^Aj  for  some  0  <  i  <  j  <  m.  Then  either  there 
exist  ?()>?,  e  C  such  that  A(C0,e,)  is  not  diagonable  or  there  exists  a  point  C0  on 
the  Riemann  sphere  (possible  co  =  «•)  and  two  distinct  eigenvalues  of  A(x)  which  touch 

—  V 

It  can  be  shown  that  for  the  matrix  (1.35.14)  the  two  distinct  eigenvalues  of  A(x) 
touch  at  ",  (Problem  1.35.25.)  However  if  A(x)  is  a  pencil,  i.e.,  A(x)  =  Aq  +  xA, 
then  two  conditions  of  Theorem  1.35.17  are  redundant.  More  precisely  we  have 
Theorem  1.35.21.  Let  A(x)  “  Aq  +  xA,  be  a  pencil  in  Mn(C[xl).  Assume  that  for  any 
C  e  C,  A(  C )  Is  a  diagonable  matrix.  Then  the  eigenvalues  of  A(x)  are  linear  functions 
in  x 


(1.35.22) 


A  ^ ( x)  =  +  B^x,  j  =  1,...,n 


In  particular  no  two  distinct  eigenvalues  of  A(x)  Intersect  at  any  point  of  the  Riemann 
sphere . 
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>  I 


Proof.  Consider  a  multivalued  function  X.(x)  which  has  the  Puiseaux  series  (1.35.3) 
-  1 

clearly  AMx)  is  also  multivalued  functions  which  is  given  by 


k*0  9  ^ 


XU1  (?)(x-?) 


As  A (C)  is  a  diagonable  matrix  any  X^(c)  is  a  geometrically  simple.  According  to 

Theorem  1.32.1  X  «  0,  k  =■  1 , . . . ,  8-1 .  So  X*  (x)  is  bounded  in  the  neighborhood  of  C. 

j*  3 

Let  C  *  Then  the  Puiseaux  series  of  X^(x)  are  of  the  form 


OB 

,  ,  .  v  >  ,^8  (•-*)/• 

X  .(  x)  -  '  X  ..  (•)* 

i  v:«  j* 


So  X^(x)  is  also  bounded  at  the  neighborhood  of  «.  Now  use  the  arguments  of  the  last 
part  of  the  proof  of  Theorem  1.35.17  to  deduce  that  X^(x)  ■  8 y  j  •  1,...,n.  This  of 
course  implies  (1.35.22).  In  view  of  (1.35.22)  the  equalities  (1.35.16)  imply  that 
a  »  a  ,  8  »  8  ,  i.e.  X  (x)  »  X  ( x)  for  all  x. 

p  q  p  q  p  q 


Problems 

(1.35.23)  Let  A(x)  e  M  (C(x)).  Assume  that  there  exists  an  infinite  sequence  of  distinct 

n 

points  (ck)1  such  that  A(C^)  is  diagonable,  for  k  «  1,2,...  .  Show  that  A(x)  is 
diagonable  for  all  but  a  finite  number  of  points.  (Hint  -  Consider  the  rational  canonical 
form  of  A(x)  over  the  field  of  rational  functions.). 

(1.35.24)  Let  A(  xp  .x^  )  •  x^  AQ  +  x^  A1  E  ( C  [  x^  ,  x  ^ )  ) .  Show  that  if  ACxq.x^  is 

diagonable  for  any  xq*x^  €  ®  then  (1.35.18)  holds  (m  «  1). 

(1.35.25)  Consider  the  mat  ix  (1.35.14).  Show  that  the  eigenvalues  of  A(xQ.x1)  are 

2  2  2  2 
X,  -  X,,  X2<x)  -  xn  ♦  X,.  Prove  for  x0  j  0  k{xn,x^)  is  diagonable.  As  AlO.x^)  -  x^ 

A ( x q , x ^ )  is  diaaonable  for  all  *0>x1  E  c*  Show  that  the  eigenvalues  of  A(1,x)  touch  at 


1.36.  Strict  similarity  of  pencils. 


Let  A(x)  and  R(x)  be  two  linear  pencils 


A(x)  =  A„  +  xA,  ,  B(x)  *  B  *  nB,  e  M  (C[x)) 
0  i  0  1  r\ 


Assume  that  A(x)  and  B(x)  are  strictly  similar.  That  is 


(1.36.1) 


B0  -  PA^'1.  B,  =  PA,P-’ 


for  some  non-singular  P  £  M^(C).  From  (1.22.6)  it  follows 

Lemma  1.36.2.  Let  A(x)  and  B(x)  be  two  pencils  in  Mn(C[x] )  which  are  strictly 
similar.  Then  the  three  pencils  in  (1.30.4)  are  strictly  equivalent. 


Using  the  Kronecker's  result  (Theorem  1.14.23)  we  can  determine  whether  the  pencils  in 
(1.30.4)  are  strictly  equivalent.  We  now  study  the  implications  of  the  assumption  that  the 
three  pencils  in  (1.30.4)  are  strictly  equivalent.  More  precisely  we  have 
Lemma  1.36.3.  Let  A(x)  and  B(x)  be  two  pencils  in  Mn(C[x])  such  that  the  first  two 


pencils  in  (1.30.4)  are  strictly  equivalent.  Then  there  exist  two  constant  non-zero 


matrices  U,V  e  M  (C)  such  that 
-  n  — —— 


(1.36.4) 


A(x)U  -  UB(  x)  -  0,  VA(  x)  -  B(x)V  -  0 


In  particular 


(1.36.5)  A„  ker(V),  A,  ker(V)  £  ker(V),  B0  ker(U),  B,  ker(U)  ker(U) 


Proof.  A a 


A(x)I  -  I A ( x)  -  0 


the  strict  equivalence  of  the  first  two  pencils  in  (1.30.4)  implies  that  C(A,B)  and 
C(B,A)  contains  at  least  one  non-zero  matrix  as  the  first  row  and  column  index  of  I  *  A 
Afc  ■  I  is  0.  Assume  that  £  €  ker(tJ).  That  is  U£  -  0.  From  the  first  eguality  in 
(1.36.4)  we  deduce  U(B(x)£)  -  0.  So  B(x)  ker  U  £_  ker  U  which  is  equivalent  to  the 
second  part  of  the  inequality  (1.36.5),  The  first  part  of  (1.36.5)  is  established  in  a 
similar  way. 


Theorem  1.36.6.  Let  A(x)  “  Ag  +  xA1.  B(x)  *  BQ  +  xB1,  A^,!^  £  M^IC),  1  ”  1,2  be  two 
given  pencils.  Suppose  that  either  Ag,A^  ot_  Bq,b1  do  not  have  a  common  invariant 
subspace  different  from  [0]  or  cP  (the  trivial  subspaces).  Then  A(x)  *  B(x)  if  and 
only  if  the  first  two  pencils  in  (1.30.4)  are  strictly  equivalent. 

Proof .  Assume  that  Ag  and  A^  do  not  have  in  common  non-trivial  invariant  subspace. 
Then  the  matrix  V  ¥  0  in  (1.36.4)  must  be  non-singular  in  view  of  (1.36.5).  So 
A(x)  J  B( x) .  In  case  that  Bp  and  B,  do  not  have  a  common  non-trivial  invariant 
subspace  we  get  that  I U I  ¥  0. 


A  simple  criterion  for  Ag  and  A,  not  have  a  common  non-trivial  subspace  is  that 

the  polynomial  I  Al  -  A(x)|  is  irreducible  over  C[x,M.  (Problem  1.36.17.) 

Next  we  show  the  connection  between  the  notions  of  analytic  similarity  of  matrices 

over  Hn  and  strict  similarity  of  pencils.  Let  A(x),B(x)  e  M  (H  )  and  assume  that 
u  n  P 

n(A,A>  «  1.  Suppose  that  A(x)  *  B(x).  According  to  Theorem  1.33.3  A(x)  *  B(x)  if  and 
only  if  there  exist  two  matrices  Tg.T^,  |Tg|  ¥  0  such  that 


AoTo  *  ToV  AiTo  +  AoTi  "  T0B1  *  TiBr  • 


(1.36.7) 


A,  A,\ 

F(A„,A.  )  -  (  „  .)  c  M,  <C> 


Then  (1.33.41  in  this  case  is  equivalent  to 


(1.36.8)  FtAg.A^FfTg.T,)  =  FtTp.T., )  F(Bg,  B,  )  . 

As  | F(Tg , T^ ) |  =  | TQ | 2 ,  T0  is  non-singular  if  and  only  if  F(Tg , T^ )  is  non-singular. 
Definition  1.36.9.  Let  Ai»Bi  e  PMC),  1  =  1,2.  Then  F( Ag , Aj )  and  FfBg.B,)  are  called 
strongly  similar  (F(A0,A1)  SF(B0,B,))  if  there  exists  a  non-singular  matrix  F(TQ,T.,) 
which  satisfies  (  1.36.9). 

Clearly  if  F  (  Ag ,  A  ^ )  2  F( Bg , B^ )  then  FlAg.A,)  *  FlBg.B,).  It  can  be  shown  that  the 
notion  of  the  strong  similarity  is  stronger  than  the  ordinary  notion  of  similarity. 

(Problem  1.36.24.) 

Lemma  1.36.10.  The  matrices  F( Ag, A^ )  and  F( Bg,B^ )  are  strongly  similar  If  and  only  if 
the  pencils 

A(  x)  -  F(0,I)  +  X  FtAjj.A,),  B(x)  «  F(  0 , 1 )  +  X  F<  BQ ,  B,  ) 

are  strictly  similar. 

Proof.  Let  P  »  (P.j),  P,,  e  M  (C) ,  i.j  =  1,2.  Then  F(0,I)P  =  PF(0,I)  if  and  only  if 
-  ij  ij  n 

P11  “  P22'  P21  ”  °*  That  is  p  *  F<P11'P12*  ancl  the  lemma  follows. 

Clearly  if  F(Ag,A,)  and  F(Rg,B.,)  are  strongly  similar  then  aq  *  Bg.  Without  the 
restriction  in  generality  we  may  assume  that  Ap  «  Bg.  (See  Problem  1.36.19.)  Consider 
all  matrices  Tg.T,  satisfying  (1.36.7).  For  Bg  «  Ag  (1.36.8)  reduces  to  AqTq  »  TQAg, 
AQT1  -  TjAg  «  TgB,  -  a^Tq .  According  to  Theorem  1.23.3  the  set  of  all  matrices  Tg  which 
satisfies  the  above  reguirements  is  of  the  form 

P(A,  ,F,  )  -  {Tn|Tn  c  C(A  ),  tr(V(T  B  -  AT.))  -  0, 

IT  0  0  O  0  1  10 

v  t  c(Ag)l  . 
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(1.36.11) 


Ke  also  observe 


Lemma  1.36.  1’.  Suppose  that  F(Ap,A,)  ?  PIAp.B,).  Then 


d.36.13' 


dim  P(  A^ ,  A1  )  -  dim  PtA,,?,)  -  dim  P(B1,B1» 


As  in  Theorem  1.22. <3  for  a  fixed  Ap,A5  there  exists  a  neighborhood  DlA^, 
the  first  two  equalities  in  (1.36.13)  imply  that  F(  Ap ,  A^ )  *  FfAp.A,)  for 
(Problem  1.36. IB'. 

Next  we  consider  a  splittino  result  analogous  to  Theorem  1.29. IB. 
Theorem  1.36.14.  Assume  that 


(0)  (0)  (0) 

Si  ,A22  \i  n . 


where  A*^'  and  A^'  do  not  have  a  common  eigenvalue.  Let 


A,  -  I  A 


mV  „ 

ij  y  1’  1 


,m 

1J  1 1 


be  the  conformal  partition  of  A,  and  B,  with  Ap.  Then 


(1.36.16) 


.  ( 1  >  _  ( 1  > 


(Ai.b,)  -  ha;;'.b;;')  *  ha^.s*”) 


Moreover,  F(Ap,A,)  5  F(Ap,B,)  if  and  only  if  f(A^,,a||))  S  F(  a|J  ’  ,b||  1 ) 
Proof .  According  to  Problem  (1.21.22) 


(P  )  ( 0  ) 

C(Ap)  -  C(a'm  )  #  C(Aj2')  . 


Then  The  trace  condition  in  (1.36.11)  reducer  to 


>)  such  that 
B1  t  D(A,p) 


for  i  ■  1,2. 


tr  [V  ^  (T 


(01,(1) 

1  ®11 


An,T<0!) 
*  II1  1  ' 


„  ®  )B  ( 1 ) 
V2  Tr  B22 


An>T(0)n 
*22  2  H 


0  . 


Here 


v  *  v1  ©  v2,  tq  *  T*0)  e  t20>  e  c(a^))  ®  ctA*®*) 


Choosing  either  =  0  or  V2  =  0  we  obtain  (1.36.16).  As  ! Tq |  =  |T^^|  Tq 

is  non-singular  if  and  only  if  Tg^  and  T^^  are  non_s^n<Ju^ar*  This  establishes  the 
last  claim  of  the  theorem. 

□ 

Thus,  the  classification  of  strong  similarity  classes  for  the  matrices  F  (  Ag ,  A  ^ ) 
reduces  to  the  case  where  Ag  is  nilpotent  (Problem  1.36.20).  In  case  that  Ag  =  0  the 
notion  of  the  strong  similarity  reduces  to  the  standard  notion  of  similarity.  In  case 
that  Aq  =  H(n)  the  strong  similarity  classes  of  F( Ag,  A^ )  are  classified  completely 
(Problem  1.36.23),  This  case  corresponds  to  the  case  discussed  in  Theorem  1.29.17.  The 
case  Ag  =  H(n)  *  H(n)  can  be  also  classified  completely  using  the  results  of  Problem 
1.33.20  (Problem  1.36.25). 

Problems 

(1.36.17)  Let  A(x)  e  M  (C[x] )  and  assume  that  A(x)D  c  O  where  O  is  a  subspace  of 

n 

0°,  1  5  dim  0  <  n-1.  Let  p(X,x)  e  C(X,x]  be  the  minimal  polynomial  of  the  restriction 
of  A(x)  to  D.  Thus  deg  p(X,x)  <  n-1.  Prove  that  p(X,x)  divides  I XI  -  A(x)l.  That 
is  | XI  -  A(x)|  is  reducible  over  C[X,x). 

(1.36.18)  Modify  the  proof  of  Theorem  1.22.9  to  show  that  for  fixed  Ap,A,  then  there 
exist  p  >  0  such  that  the  first  two  eoualities  in  (1.36.13)  for  B  c  P(A,p)  imply  that 
F( Ag, A, )  5  F(A0,B,). 

(1.36.19)  Prove  that  F(Ag,A,)  5  f(fg,B,)  if  and  only  if  F(Ag,A,)  5  FfPBgP'1 .PB,?"1 ) 
for  any  non-singular  P.  Suppose  that  FfAg.A,)  *  FIBg.Bj).  Show  that  it  is  possible  to 
choose  P  such  that  Ag  =  PBgP*1. 
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(1.36.20)  Prove  that  F(  Aq, A^ )  (SFIBp.B^  if  and  only  if  F(  AQ  -  XI, A^  5  FIB^  -  XI,  B1  ) 
for  any  X. 


(1.36.21)  Let  F(A0,...,AS_1)  “  (F^),  Fij  c  Mn(C)  '  *'3  ”  1,...,s,  F.^  =  A^  for 

1  <  i  <  j  4  s,  F^  *=  0  for  1  <  j  <  i  <  s.  F(  Aq,  . . . ,  As-1 )  and  F(  bq  , . . .  ,BS_, )  are 
called  strongly  similar  ( F(  Ap, . . . , Ag_^)  *  F( Bp , . . . , Bg_1 ) )  if  there  exist 
F(T0 

Prove  that  F( AQ, . . . , As-1 )  %  F(BQ , . . . , Bg_1 )  if  and  only  if  the  equalities  (1.33.4)  hold 
for  k  =  0,1,...,s-1  where  I Tq |  y  0. 

(1.36.22)  Let 


L0 . Vl>  such  that  F(R0 . Vl)F<T0 . l,.l'  -  F(T0 . Ts-1)F(B0 . Bs-1>- 


Z  =  H(n)«...«H(n) ,  X  =  (X  ) ,  Y  =  (Y  )  E  M  (C),  m  =  sn  , 

pq  pq  m 

X  -  (x!^q>),  Y  =  <y!?q>>  e  M  (C),  p,q  =  1 . .  . 

pq  ij  pq  i)  " 


Define 


A 


r 


(a(r)).  B 
pq  r 


(b(r>)  e  M  (C) 
pq  S 


r+1 

l 


i=1 


<pq) 

(n-r+i-1)i' 


b(r> 

pq 


r+1 

V 

L 

i-1 


jpq> 

y(n-r+i-1 


)i ' 


r 


0,...,n-1  . 


Using  Theorem  1.21.9  prove  that  F(Z,X)  S  F(Z,Y)  if  and  only  if  F(  Ap , . . . ,  A,,., )  5 
F(B0, . . .  ,Bn_1 ) .  (To  do  that  one  needs  the  following  auxiliary  result.  Consider  X  =  (Xpq) 
of  the  above  form.  Assume  that  each  Xpq  is  an  upper  triangular  matrix.  Expand  the 
determinant  of  X  by  the  rows  n,2n,...,sn  and  use  the  '  iduction  to  show 


I  X] 


n 

n 

r«1 


(x<pq))  .1). 

rr  p,q«1 


(1.36.23)  Use  the  two  preceding  problems  to  prove  that  F(H(n),X)*  F(H(n),Y),  X  «  (x^), 

Y  »  (y ,  . )  e  M  (C)  if  and  only  if 
il  n 
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(1.36.24)  Let  X  =  (x  )  e  M2<C).  Prove  that  if  x21  ¥  0  then  F(H(2),X)  =  H(4). 
Combine  this  result  with  Problem  1.36.23  to  show  the  existence  of  Y  e  M2(C)  such  that 
F(H(2),X>  .  F(H ( 2 )Y )  but  F(H(2),X)  %  F(H(2) ,Y) . 

(1.36.25)  Assume  in  Problem  (1.36.22)  s  =  2.  Let 


n-1  n-1 

A(x!  =  l  A  x1,  B(X)  =  T  B.x1  e  M  (H  )  . 

i=0  1  i=0  1 


Using  the  results  of  Problems  (1.36.21)  -  (1.36.22),  Section  1.33.3  and  Problem  1.33.20 
prove  that  F(Z,X)  5  F(Z,X)  if  and  only  if  the  three  matrices  in  (1.30.4)  have  the  same 
local  invariant  polynomials  up  to  the  degree  n-1. 
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I. 37.  Note? 

Vest  of  rue  material  in  Sections  1.1  -  1.8  is  standard.  See  Lang  [1967)  and  van  der 
Waerder.  [1959]  for  the  algebraic  concepts.  Consult  Gunning-Rossi  [1965]  and  Rudin  [1974] 
for  the  material  concerning  the  analytic  functions.  See  Kaplansky  [1949]  for  the 
properties  of  elementary  divisor  domains.  It  is  an  open  problem  whether  there  exists  a 
Bezout  domain  which  is  not  an  elementary  divisor  domain.  Theorem  1.5.6  for  ft  •  C  is  due 
to  Helmer  [1940].  A  nice  introduction  to  the  theory  of  algebraic  varieties  can  be  found  in 
Lange  [1958] . 

Section  1.9  is  standard,  e.g.  Curtis  and  Reiner  [1962]  and  MacDuffee  [1933].  Most  of 
the  content  of  Section  1.10  is  well  know,  e.g.  MacDuffee  [1933].  Perhaps  Lemma  1.10.3  is 
not  common.  The  content  of  Section  1.11  seems  to  be  new  since  the  underlying  ring  is 
assumed  to  be  only  a  Bezout  domain.  In  case  that  the  underlying  is  BDD,  i.e.,  A  is 
eouivalent  to  a  diagonal  matrix.  Theorems  1.11.7  and  1.11.12  are  well  know.  It  would  be 
interesting  to  generalize  Theorem  1.11.7  for  D  =  F[x,,,...,x  ],  for  p  >  2.  The  fact  that 
the  Smith  normal  form  can  be  achieved  for  the  elementary  divisor  domain  i3  due  to  Helmer 
[1943].  Consult  also  Kaplansky  [1949]. 

Most  of  the  results  of  Section  1.13  are  from  Friedland  [1979b].  It  is  nn  open  problem 

whether  the  results  of  Problem  1.13.26  hold  for  any  8  c  f .  In  case  that  ft  «  D(0,p), 

G  2 

{ C  I  C  "  (  5 1  <  •  •  •  <Cp)  <  I  Ujl  <  p)  the  results  of  Problem  1.13.26  apply.  This  follows 
from  the  Cartan  theorem  b,  e.g.  Gunning  and  Rossi  [1965].  This  result  is  due  to 

J.  Mather  (unpublished). 

The  exposition  of  Section  1.14  is  close  to  Gantmacher  [1959).  The  content  of  Section 
1.15  is  standard.  Theorem  1.16.7  is  well  known  (e.g.  Gantmacher  [1959]).  Other  results  of 
Section  1.16  are  not  common  and  some  of  them  may  be  new.  Section  1.17  is  standard  and  its 
exposition  is  close  to  Gantmacher  [1959].  Theorem  1.18.5  is  probably  know  of  BDD  (see 
Leavitt  [1948]  for  the  case  n  =  H(ft),  ft  c  C) .  Perhaps  it  is  new  for  Bezout  domains.  The 
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results  of  Section  1.19  are  standard.  Theorem  1.20.10  appears  implicitly  in  Friedland 
[1979b].  The  exposition  of  Section  1.21  is  close  to  Gantmacher  [1959].  For  additional 
properties  of  the  tensor  product  of  matrices  see,  for  example,  Marcus  and  Mine  [1964]. 
Problem  1.21.2R  is  close  to  the  results  of  Faddeev  [1066]  for  necessary  and  sufficient 
conditions  For  the  similarity  of  A  and  B  over  Z.  See  also  Guralnick  [1980]  for  a 
arbitrary  integral  domain  D.  The  results  of  Section  1.22  are  recent.  Theorems  1.22.3  and 
1.22.7  are  taken  from  Friedland  [1979b].  See  Gauger  and  Byrnes  [1977]  for  a  weaker  version 
of  Theorem  1.22.7.  Some  of  the  results  of  Section  1.23  seem  to  be  new.  Theorem  1.23.3  was 
taken  out  of  Friedland  (1979a).  Theorem  1.24.1  is  due  to  Friedland  [1979b). 

The  exposition  of  Section  1.25  is  close  to  Gantmacher  [1959].  The  results  of  Section 
1.26  were  inspired  by  the  paper  of  Rothblum  (1980).  The  notions  of  local  indices  can  be 
found  in  Friedland-Schneider  [1980].  The  content  of  Section  1.27  is  standard.  Theorem 
1.27.9  can  be  found  for  example  in  Wielandt  [1967]  and  Problem  1.27.12  in  Gantmacher 
[1959].  The  use  of  the  Cauchy  integration  formula  to  study  the  properties  of  the  analytic 
functions  of  A  is  well  accepted.  See  for  example  Kato  [1976].  The  results  of  Section 
1.29  are  due  to  Arnold  [1971],  See  also  Wasow  [1977].  See  Wasow  [1963],  [1977]  and  (1978] 

for  the  notions  of  analytic  and  pointwise  similarity  and  their  importance  in  theory  of 
differential  equations  in  the  neighborhood  of  singularities.  Theorem  1.30.8  in  case  of  one 
complex  variable  appears  in  Friedland  [1979b).  Corollary  1.30.11  goes  back  to  Wasow 
[1963].  Theorem  1.31.1  for  simply  connected  domains  is  due  to  Gingold  [1978].  See  Wasow 
[1978]  for  the  extension  of  Theorem  1.31.1  to  certain  domains  SI  ^  .  It  is  shown  there 

that  Theorem  1.31.1  fails  even  for  some  simply  connected  domains  in  C^. 

Theorem  1.32.1  can  be  found  in  Kato  [1976]  or  Friedland  [1978].  The  results  of 
Sections  1.33  -  1.34  were  taken  from  Friedland  [1n79b).  It  is  worthwhile  to  mention  the 
conjecture  stated  there  that  A(x)  and  B(x)  are  analytically  similar  over  if  the 

three  matrices  in  1.30.4  are  equivalent  over  Hp.  Theorem  1.35.11  is  new.  Theorem  1.35.17 
is  taken  form  Friedland  (1980).  Theorem  1.35.21  and  Problem  1.35.24  are  due  to  «otzkin- 
Taussky  [1955].  See  also  Kato  (1976)  for  a  proof  of  these  results  using  the  method  of 
analytic  functions.  Most  of  the  results  of  Section  1.36  are  taken  from  Friedland 


(1979a-bl.  Some  results  and  references  on  the  problem  of  strict  similarity  of  pairs 
(A,B)  of  matrices  under  the  simultaneous  similarity  can  be  found  in  Brenner  [19751.  See 
also  Procesi  for  the  extensive  treatise  on  the  invariants  of  pairs  ( A, B)  under  the  strict 
similarity. 
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